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RA antigenic peptides 



The present invention provides novel naturally-processed RA antigenic peptides 
which are candidate markers for erosive and non-erosive RA. These antigenic peptides are 
presented by human MHC dass II HLA-DR molecules. Moreover, these antigenic peptides 
linked to MHC dass II molecules, as wdl as antibodies reactive with said antigenic 
5 peptides, nucleic acids encoding said antigenic peptides, and nucleic add constructs and 
host cells for expressing said antigenic peptides are provided- The antigenic peptides of the 
invention as well as the polypeptides they are derived from can be used as markers in 
diagnosis of RA and in tiierapy as aiiti-RA vaccines. 



10 Rheumatoid Arthritis (RA), originally termed chronic polyarthritis, is a systemic 

autoimmune disease and one of the most debilitating forms of articular inflammation 
(Fddmann, M. et al.. Cell 85 (1996) 307-310; Dedhia, H.V. & DiBartolomeo, A., Critical 
care clinics 18 (2002) 841-854). Typically, RA causes joint pain, deformities and severe 
joint stiffriess. The disease can also have its manifestation outside the joints, espedally in 

15 patients who are positive for an autoantibody, termed "rheumatoid factor** (RF) (Mageed, 
RA., in: van Vemooij, W.J. & Maini, R.N. eds.. Manual of biological markers of disease, 
Kluwer Academic Publishers (1996) 1-18). RA occurs quite frequentiy in the Caucasian 
population with the susceptibility to RA being influenced by genetic and environmental 
factors. Both have a crudal effect on the onset and the progression of this autoimmune 

20 disease. Approximately 4% of the total population has an increased genetic susceptibility to 
RA, roughly 20% of which (around 1% of the total population) develops RA as a result of, 
as yet, uncharacterized non-inheritable factors. Beyond that, RA shows a significant bias in 
the sex ratio: women have a three fold higher risk for RA than men, indicating that sex 
hormones may also be involved in the pathogenesis. 

25 In the beginning, RA progresses slowly. Typical early stage symptoms are palm 

sweating, morning stifi&iess of fingers and symmetrical joint inflammation. In addition, 
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rheumatoid nodules can appear which is an indication for tissue affection outside tiie 
joints. In a simplified model, the immrme system produces autoantibodies against healthy- 
tissue. These autoantibodies attack the articular cartilage in the joint leading to its 
inflammation and later on to its destruction. This destruction stimulates tiie immune 
system to produce more autoantibodies. In addition, cytokines like tumor necrosis-factor 
alpha (TNF-a) and Interleukin-l (IL-1) are produced which enhance the inflammatory 
reaction even further (Houssiau, F.A., Clin Rheumatol 14 Suppl 2 (1995) 10-13). The 
synovium begins to swell due to infiltration of additional cells of the immune system, such 
as macrophages and T cells. These cells are actively involved in causing fiirther cell death 
and in driving joint inflammation (Fox, DA., Arthritis Rhexmi 40 (1997) 598-609; Choy, 
E.H. & Panayi, G.S., N Engl J Med 344 (2001) 907-916). This process resembles a vicious 
circle of autoantibody production, joint inflanmiation and joint destruction. 

Typically, RA progresses chronically, with 85-90% of all RA patients showing a mild 
to moderate disease development. Aggressive disease forms leading to complete loss of joint 
fimction up to the degree of invalidity is experienced by 10-15% of the patients. In this 
advanced RA state, patients have a permanent articular inflammation and display 
rheumatoid nodules. They suffer from strong chronical pain and the inflammation leads to 
severe finger stifiGaess and irreversible joint deformations or dislocations. 

Diagnosis 

There is growing evidence that therapeutic intervention early in the disease can 
reduce tiie extent of joint damage (Egsmose, C. et al., J Rheumatol 22 (1995) 2208-2213; 
Van der Heide, A et al., Ann Intern Med 124 (1996) 699-707). Since treatment with 
disease-modifying antirheiunatic drugs (DMARDs) is only justified when the risfcbenefit 
or costrefFectiveness ratios are favorable, it is mandatory to be able to differentiate between 
RA and otiier forms of artiiritis shortiy after onset of tiie disease (Kirwan, J.R. & Quilty, B., 
Clin Exp Rheumatol 15 (1997) 15-25). The diagnosis is made by established criteria based 
on clinical history, physical examination and laboratory tests. The American Society of 
Rheumatism published a catalog of criteria to help gaining objective evidence for RA 
(Arnett, F.C et al.. Arthritis Rheum 31 (1987) 315-324). But so far, not a single test is 
available which is specific for RA. Several biological and biochemical markers, e.g. C- 
reactive protein (CRP), eryflirocyte sedimentation rate (ESR), antinuclear antibody (ANA) 
or RF are utilized for the evaluation of RA. However, tiiese markers are non-specific, as 
tiiey appear in other inflammatory or autoimmune diseases as well. The RF, for instance, is 
an autoantibody that is present in the seram of approximately 50% of RA patients. Since 
increased levels of tiie same autoantibody can also be found in the context of other 
inflammatory diseases, such as Sjogren syndrome, endokarditis or chronical hepatitis, RF is 
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unsuitable to serve as a diagnostic marker for RA. Rather than being of diagnostic value per 
se, the above mentioned biochemical and biological markers are useful for assessing disease 
activity and prognosis as well as in the treatment and management of RA patients. 
(Nakamura, R.M., J Clin Lab Anal 14 (2000) 305-313). 

Recently, a diagnostic set of criteria was developed that consists of clinical and 
biochemical aspects which were claimed to discriminate, at an early state, between self- 
limiting, persistent non-erosive, and persistent erosive RA (Visser, H. et al.. Arthritis 
Rheum 46 (2002) 357-365). Self-limiting arthritis was characterized by natural remission: 
there was no arthritis on examination in a patient for a certain period of time. Erosive 
arthritis was defined based on the presence of erosions on radiographs of the hands and/or 
feet. In particvilar, the use of antibodies recognizing cyclic citruUinated peptides appears to 
be promising and suggests an important role for citruUinated antigens in the early diagnosis 
and prognosis of erosive RA (Schellekens, G.A. et al., J Clin Invest 101 (1998) 273-281; 
Vincent, C. et al., J Rheumatol 25 (1998) 838-846). The early recognition of erosive RA 
allows early intervention with DMARDs, which will lead to earlier disease control and 
improvement of disease outcome (Symmons, D.P.M. et al,, J Rheumatol 25 j(1998.) 1072- 
1077; Anderson, J.J. et al.. Arthritis Rheum 43 (2000) 22-29). Likewise, early recognition of 
self-limiting and non-erosive arthritis will prevent unnecessary treatment with potentially 
toxic therapeutics (Fries, J.F. et al.. Arthritis Rheum 36 (1993) 297-306. 

Therapy 

The goal of any anti-rheiamatic therapy is to relieve pain in order to ease the activities 
of every day life. So far, complete healing of RA is not possible, but by applying modem 
therapies the progression of the disease can be slowed down or even stopped. Due to 
individual differences, each patient requires an individualized therapy and early diagnosis, 
as mentioned before, is desirable. RA therapy is complex and includes lifelong medicinal 
treatment as well as physio- and radiotherapy. DMARDs used in RA therapy are basic 
therapeutics (e.g. Methotrexate, Sulfasalazin, Hydroxychloroquin, Leflunomid, 
Azathioprin), cortisone, non-steroidal anti-inflammatory drugs (NSAID) or monoclonal 
antibodies against the pro-inflammatory cytokines TNF-a, IL-lp or their respective 
receptors (http://rheuma-online.de). These drugs have all in common that they are 
inhibitors of inflammation by suppressing the immtme response. The main disadvantage is 
their lack of specificity for RA, their adverse effects and their inability to effectively target 
the causes of RA. 

Autoimmunity 
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Autoimmunity starts when a specific adaptive immune response is initiated against 
self antigens (autoantigens) manifested by the development of self-reactive T or B ceUs. The 
normal consequence of an adaptive immune response against a foreign antigen is the 
clearance of the antigen firom the body. When an adaptive immune response develops 
against a self antigen, however, the antigen can in most cases not be completely removed 
from the body, leading to a sustained immime response. As a consequence, the effector 
mechanisms of immunity cause chronic inflammatory injury to tissues. The mechanisms of 
tissue damage are essentially the same in autoimmune disease as those that operate in 
protective immunity and in hypersensitivity. Even though it is not well understood what 
triggers autoimmunity, several events which are nowadays believed to contribute to the 
induction of autoimmime diseases and selection of autoantigenic targets have been 
summarized most recently (Marrack, P. et al., Nat Med 7 (2001) 899-905). 

Autoimmune diseases are controlled by properties of particular genes of each 
individual and environmental factor. The host's genes affect the susceptibility to 
autoimmunity at least at three levels. First, some of the genes affect the overall reactivity of 
the immune system and, thus, can predispose the individual to certain or to several 
different types of autoimmune diseases. Second, this altered immunoreactivity is fiinneled 
to particular autoantigens and tissues by other genes that affect recognition of antigenic 
peptides by T cells. Third, still other genes act on the abihty of target tissues to modulate 
immune attack for instance by influencing the activity of effector cells of the immune 
system which are destined to initiate an autoaggressive attack. The latter two sets of genes 
dictate which antigens will be the targets of autoimmunity and hence which organs will be 
attacked and what damage will occur. 

In addition, signals from the environment influence the development of 
autoimmunity at the same three levels, by affecting the overall reactivity of the immune 
system, the antigen-spedficity and the state of the potential target tissue. And finally, there 
is cross-talk between genetic and environmental factors. 

Major histocompatibility complex (MHC) 

Population studies, genotyping and modem approaches at the molecular level have 
imanimously shown that certain genes encoded by the major histocompatibility complex 
(MHC) confer a significantly higher risk for the development of RA (Stastny, P., Tissue 
Antigens 4 (1974) 571-579; Wordsworth, P. et al., PNAS 86 (1989) 10049-10053; 
Wordsworth, P. 8c Bell, J., Springer Semin Immunopathol 14 (1992) 59-78). In particular, 
the dass II MHC alleles HLA-DRBimOh *0401y ''0404 and *0405 in several ethnic groups 
increase the susceptibihty to RA (Reveille, J., Curr Opin Rheumatol 10 (1998) 187-200). 
E.g. more than 90% of RF-positive RA patients carry one of these susceptibility alleles. HLA 
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class II moleciiles are MHC-surface proteins that bind antigenic peptides within the cell 
and present them on the surface of antigen-presenting cells for interaction with the T ceU 
receptors of CD4"^ helper T lymphocytes, thereby initiating a cellular immune response 
(Banchereau, J. & Steinman, R.M., Nature 392 (1998) 245-252). The RA-association of 
particular HLA class 11 molecules together with the presence of large numbers of activated 
CD^'^ T cells in synovial tissue has supported the model of disease induction in which 
disease-associated HLA-DR molecules present disease-relevant (e.g. synovial) autoantigens 
and cause stimulation and expansion of synovial T cells, which then drive the inflammatory . 
process (Striebich, C.C. et al., J Immunol 161 (1998) 4428-4436). 

MHC class II HIA-DR (short: DR) proteins are heterodimers consisting of 
monomorphic a- and extremely polymorphic p-chains that bind peptide antigens in a 
peptide binding groove. This groove generally has fovir major pockets to accept side chains 
at relative positions 1, 4, 6 and 9 of the peptide (Stern, L.J. et al.. Nature 368 (1994) 215- 
221). The allelic variations between HLA class 11 molecules account for the differential 
abiUty to bind antigenic peptides. This is the rationale why individuals differing in their 
HLA alleles have divergent antigenic peptide repertoirs, thereby leading to differences in 
the quality of immune responses (Messaoudi, I. et al.. Science 298 (2002) 1797-1800). 

Peptides bound by class n MHC molecules are typically longer and more 
heterogeneous in size (11-25 amino acids) than the peptides bound by dass I MHC 
molecules (8-10 amino adds). This difference arises because the peptide binding groove of 
dass II proteins is open and while peptides are gripped in the midle, their ends can extend 
out of the groove in a variable fashion (Jones, E.Y., Curr Opin Immunol 9 (1997) 75-79). 
As a consequence, dass II molecules typically bind sets of overlapping peptides that share a 
common core sequence, termed "T cell epitope", but have different lengths. 

More than a decade ago, it was recognized that the DRp chains encoded by RA-linked 
DRBl alleles, although exhibiting polymorphic differences, all share a stretch of identical or 
almost identical amino adds at positions 67-74, known as the "shared epitope" (Gregersen, 
P.K. et al.. Arthritis Rheimti 30 (1987) 1205-1213). Since immunity to autoantigens has 
been regarded central to the pathogenesis of RA, it was hypothesized that the shared 
epitope could impose disease linkage on the respective DR molecules by at least two 
different mechanisms: first, by sdecting the rdevant autoantigenic peptides for 
presentation, and second, by selecting the appropriate autoreactive Tcell spedfidties 
during ontogeny. The three-dimensional structure of DR molecules has indeed revealed 
that the shared epitope is located in the center of the a-hdix flanking one side of the 
peptide binding groove (Stem, L.J. et al.. Nature 368 (1994) 215-221). Thus, strategically 
this shared epitope region is positioned in such a way that it can interact with both bound 
peptide and T cell receptor. 
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However, one of the unresolved mysteries in rheumatology research is the question 
what are the key arthritogenic antigens and epitopes in man that trigger the onset and the 
development of RA. Although autoantibodies of different specificity have been identified in 
serum and synovial fluid of patients it is often imclear whether the antigens which were 
released at the time of cartilage degradation, were initiating pathogenicity or whether they 
are merely a consequence of antigen spreading as a result of inflammation (Corrigall, V.M. 
& Panayi G.S., Grit Rev Immunol 22 (2002) 281-293), Furthermore it is difficult to define 
pathogenic mechanisms in which the antigen is present throughout the body, including the 
joint, but the pathology is targeted solely or predominately to the joint. 

Autoantigens 

The large number of possible autoantigens in RA is derived firom studies using sera 
or, less frequendy, T cells firom patients with estabUshed chronic RA. One of the most 
convincing joint-specific antigen that has been proposed in the context of DR molecules, is 
type II collagen (CII), the predominant protein in articular cartilage. Autoantibodies 
against CII were found in elevated concentrations in the serum and joints of RA patients 
although it is not yet clear whether anti-CII antibodies are pathogenic in RA (Banerjee, S. et 
al., Clin Exp Rheumatol 6 (373-380). Snowden and coworkers have shown that peripheral 
blood T cells from RA patients proliferated to CII, most pronounced in those patients with 
anti-CII antibodies. However, the response was seen only in 50% of patients (Snowden, N. 
et al.. Rheumatology 40 (1997) 1210-1218). In a mouse model immunization with CII was 
shown to induce arthritis in mice expressing the class 11 MHC alleles DRBl *040J and '*^0101 
(Rosloniec, E.F. et al., J Exp Med 185 (1997) 1113-1122; Rosloniec, E.R et al., J inmiunol 
160 (1998) 2573-2578). The immimodominant epitope in both ^^0401 and '^OlOl transgenic 
mice was localized to peptides within residues 261-273 of human CII (Fugger, L. et al., Eur 
J Inununol 26 (1996) 928-933). The same epitope of CII was capable of stimulating a T cell 
response in RA patients, particularly in the early stages of disease. Synovial fluid T cells 
were especially responsive (Kim, H.Y. et al.. Arthritis Rheum 42 (1999) 2085-2093). 

Although other cartilage proteins have been proposed as RA candidate antigens, 
DR4-binding epitopes have been defined only for human cartilage glycoprotein 39 
(HCgp39). This protein is secreted by synovial cells and articular chondrocytes and its 
expression is upregulated in plasma and joints dxiring infhunmation (Vos, K. et al., Ann 
Rheum Dis 59 (2000) 544-548). Similar to CII, HCgp39 treatment induces arthritis in 
mice. In addition a HCgp39 response of peripheral blood T cells firom RA patients was 
detected (Verheijden, G.F. et al.. Arthritis Rheum 40 (1997) 1115-1125). The predominant 
epitope recognized by T cells in DR4 patients was defined between residues 263-275 and 
identical to flie immunodominant epitope found in Di?Bl*040 J -transgenic mice after 
immmiization with native HCgp39 (Cope, A.P. et al.. Arthritis Rhexmi 42 (1999) 1497- 
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1507). Although not disease specific, responses to this peptide did correlate with disease 
activity in RA patients (Baeten, D. et al.. Arthritis Rheum 43 (2000) 1233-1243). Antibodies 
to HCgp39, however, have also been detected in the sera of patients with inflammatory 
diseases, such as inflammatory bowel disease and systemic lupus erythematosus (SLE), 
albeit at a lower level than in RA. 

In an attempt to track antigen-specific T cells in RA, soluble peptide-DR4 tetrameric 
complexes were used to detect synovial CD4"*' T cells reactive with CII or HCgp39 in DR4"*' 
patients (Kotzin, B.L. et al., PNAS 97 (2000) 291-296). The CII-DR4 complex botmd in a 
specific manner to CII peptide-reactive T cell hybridomas, but did not stain a detectable 
firaction of synovial CD4'^ cells. Almost similar results were obtained with the HCgp39-DR4 
complex suggesting that the major oUgoclonal 004"*" T cell expansions present in RA joints 
are not specific for the dominant CII and HCgp39 determinants described above. 

In summary, despite some strong indications for a CII and HCgp39 association with 
RA, the evidence that they are important antigens in RA is scanty. A direct proof that 
peptides of CII or HCgp39 are presented in a class II MHC-restricted manner by antigen- 
presenting cells with subsequent stimulation and activation of synovial CD4'*" T cells is still 
lacking. Furthermore a major problem of animal models is their xmknown relevance to RA 
as Cn-induced arthritis by immimizing rats or mice differs in many respects from RA. 

Naturally processed MHC class Il-associated peptides 

An alternative strategy to the identification of RA-specific autoantibodies and T cells 
relies on the sequence analysis of naturally processed peptide antigens bound to MHC dass 
II molecules. With the help of monoclonal antibodies, dass n MHC molecules conferring 
susceptibility to RA can be purified firom cognate cells. RA-assodated peptide antigens can 
be add-duted firom purified HLA dass II molecules. The mixture of small peptides can be 
separated by HPLC and the peptide sequence be determined by Edman sequencing or mass 
spectrometry. Due to hmitations with peptide purification and sequencing techniques, 
peptide sequences were, as yet, only obtained firom MHC molecules that have been isolated 
fi^om cvdtured B ceQ lines or large amounts of tissue, and the analysis was restricted to a few 
abundant peptides (Kropshofer et al., J.Exp.Med. 175 (1992) 1799-1803; Chicz, R.M. et al., 
J Exp Med 178 (1993) 27-47). As a result of the development of high-resolution 
microcapiUary HPLC coltmms and more sensitive mass spectrometers, MHC-boimd 
peptides can be analyzed more efficiently (Dongre, A.R. et al., Eur J Immunol 31 (2001) 
1485-1494; Engelhard, V.H. et al., Mol Immunol 39 (2002) 127-137). 

In the present invention a modified peptide isolation and sequencing tedmique was 
used to investigate the peptide antigen repertoire of HLA-DR4 molecules derived firom 
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autologous dendritic cells (DCs) which were pulsed with serum or synovial fluid derived 
from RA patients. The main advantage of this innovative approach is the usage of hiiman 
DCs that are professionals in RA-relevant antigen processing and presentation, instead of 
using transgenic animal models or artificial B cell lines. 

DCs are enriched in rheumatoid synovial fluid and tissue and are derived from 
circulating immature precursors (Thomas, R. et al., J Immimol 152 (1994) 2613-2623). 
They are the most potent antigen-presenting cells which express high levels of MHC 
molecules together with a variety of accessory molecules (Mellman, L et al.. Trends Cell 
Biol 8 (1998) 231-237). In a most recent study, it was shown that ex vivo differentiated 
hiunan DCs and macrophages that are phenotypicaUy similar to antigen-presenting cells 
from RA synovial joints, were capable of generating and presenting unmunodominant 
epitopes from CH and HCgp39 (Tsark, E.C. et al., J Inmtiunol 169 (2002) 6625-6633). DC 
have the capacity to prime CD4'*' helper T cells and to effectively activate cytotoxic CDS"^ 
T cells (Ridge, T. et al.. Nature 393 (1998) 474-478). Thus, peptides bound to MHC class II 
molecules and presented by DCs play a superior role in the pathogenesis of diseases 
involving T cell-driven immune responses. 

Therefore, the problem posed by the lack of knowledge of MHC dass II restricted 
antigenic peptides for RA is solved by providing novel naturally-processed MHC dass n 
associated RA antigenic peptides and the polypeptides they are derived from as markers for 
RA. 

The present invention provides novel naturally-processed antigenic peptides which are 
candidate RA markers in erosive and non-erosive RA. These antigenic peptides are 
presented by human MHC class II HLA-DR molecules derived from dendritic cells which 
were pulsed with serum or synovial fluid derived from patients with established erosive or 
non-erosive RA. The MHC class II antigenic peptide of the invention are comprising (a) at 
least the amino acid sequence of the peptide binding motif sdected from the group 
consisting of SEQ ID NOs. 49 to 57 and SEQ ID NOs. 103 to 122, or (b) at least the amino 
acid sequence of the peptide binding motif sdected from the group consisting of SEQ ID 
NOs. 49 to 57 and SEQ ID NOs. 103 to 122 with additional N-and C-terminal flanking 
sequences of a corresponding sequence sdected from the group consisting of SEQ ID NOs. 
1 to 39 or SEQ ID NOs. 58 to 102, and originate from interferon-y-inducible lysosomal 
thiol reductase, integrin beta-2, phosphatitylinositol-4,5-bispho5phate 3-kinase, urokinase- 
type plasminogen activator, immunoglobulin heavy chain V-III region (Vh26), DJ-1 
protein, apolipoprotein B-lOO, 26S proteasome non-ATPase regulatory subunit 8, 
interleukin-1 receptor, fibromodulin, GM-CSF/IL-3/IL-5 receptor, sorting nexin 3, inter-a- 
trypsin inhibitor heavy chain H4, complement C4, complement C3 (a-chain), complement 
C3 (P-chain), SH3 domain-binding glutamic add-rich-like protein 3, interleukin-4- 
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induced protein 1, hemopexin, Hsc70-interacting protein, invariant chain (li), retinoic acid 
receptor responder protein 2, fibronectin, cathepsin B, tripeptidyl-peptidase II, legumain, 
platelet activating factor receptor, poly-alpha-2.8-sial^transferase, and ras-leated protein 
Rab-1 IB. The present invention also provides these antigenic peptides and the proteins 
they are derived from as markers for erosive and/or non-erosive RA. Moreover, these 
antigenic peptides linked to MHC class II molecules, as well as antibodies reactive with said 
antigenic peptides, nucleic acids encoding said antigenic peptides, and nucleic add 
constructs, host cells and methods for expressing said antigenic peptides are provided. 
Further methods are provided for isolating and identifying RA antigenic peptides. 

Fig. 1: Diagram of Dendritic cell (DC)-mediated analysis of tissue samples: 
Dendritic cells (DCs), the most speciaUzed antigen-presenting cells (APCs), are brought in 
contact with an antigen source (e.g. synovial fluid) imder optimal conditions for antigen 
uptake and antigen processing. As a control, DCs are cultured under the same conditions 
in the absence of synovial fluid antigens. After maturation of DCs, antigen-loaded MHC 
class II molecules are purified and the respective MHC dass Il-associated antigenic 
peptides are isolated and identified. 

Fig. 2A: ION-TRAP MS Base Peak Chromatogram of MHC class Il-associated 
antigenic peptides that were isolated firom dendritic cells pulsed with the serum of a RA 
patient. The peptides were eluted directly firom a RP-C18-HPLC column into the ion trap 
mass spectrometer for unmediate MS/MS identification. The numbers indicate the 
retention times (upper value) and the molecular masses (lower value) of the most 
prominent peptide peaks in the mixture at the respective time. 

Fig. 2B: ION-TRAP MS spectrum of antigenic peptides at a retention time of 
65.4 min. The marked peak was further firagmented and corresponded to a doubly charged 
peptide ion firom the inter-alpha-trypsin inhibitor ITIH4 (cf. table 3). 

Fig. 2C: ION-TRAP MS/MS spectrum of the doubly charged peptide ion at m/z 
977.1. The firagmentation masses, together with the mass of the parent ion, were searched 
against a non-redundant human database by using the SEQUEST algorithm. The retrieved 
sequence MPKNWFVIDKSGSMSGR (one-letter-code) corresponded to the dominant 
epitope rnH4 (271-288) of the inter-alpha-trypsin inhibitor. The positions of the assigned 
series of N-terminal B-ions and C- terminal Y-ions are marked. 

Fig. 3: Summary of the differential binding capacity of the tested candidate RA 
antigens in the context of binding to the HLA-DRBl'^0401 allele. The putative HLA- 
DRB1'*'0401 binding motif is boxed in grey. and. As a measure for aflSnity, the peptide 
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concentration was determined that was needed to reduce binding of a fixed amount of 
biotinylated HA(307-319) peptide by 50% (IC50) through competition. The reciprocal (1/ 
IC50) direcdy correlates with peptide afi&nity: As a reporter biotinylated HA(307-319) 
peptide from influenza hemagglutinin (Rothbard, J.B. et al.. Cell 52 (1988) 515-523) was 
5 included in the study. 



The antigenic peptides of the invention are peptides, which are associated with and 
presented by MHC molecules and thereby can have the potential to activate or tolerize 
10 T cells. Antigenic peptides presented by MHC class 11 molecules are therefore MHC dass II 
eissociated or MHC class II antigenic peptides, whereas antigenic peptides presented by 
MHC dass I molecules are MHC class I associated or MHC dass I antigenic peptides. 

Peptides which are derived firom proteins that are encoded in the genome of the body 
or an APC are denoted as "sdf-peptides". The main function of self-peptides presented by. 
15 DCs in the peripheral lymphoid organs is thought to be the induction of T cell tolerance to 
self-protdns. Tolerance is the failure to respond to an antigen; when that antigen is borne 
by sdf tissues, tolerance is called self tolerance. 

Antigens which are derived firom an individual's own body are called "self antigens" 
or "autoantigens". An adaptive immune response directed against self antigens is called an 

20 autoinunune response. Likewise, adaptive immtmity specific for self antigens is called 
autoinmiunity. Autoreactivitjr describes immune responses directed against self antigens. 
RA is probably due to an autoimmune response that is based on the involvement of 
autoreactive T cells and/ or autoreactive antibodies. Immunogenic peptide indudes, but is 
not limited to, an antigenic peptide capable of causing or stimulating a cellular or humoral 

25 inunune response- Such peptides may also be reactive with antibodies. 

Peptides derived firom proteins encoded in the genome of bacteria, viruses or other 
foreign invaders and which differ firom self-proteins are called "fordgn antigenic" or 
"fordgn" peptides. They are able to didt a T cell response against fordgn proteins they are 
derived firom. 

30 RA antigenic peptides are self-peptides that function as self antigens and as a 

consequence of the disesise erroneously trigger autoreactivity against self tissues. 
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The present invention provides a MHC class 11 antigenic peptide comprising (a) at 
least the amino add sequence of the peptide binding motif selected from the group 
consisting of SEQ ID NOs. 49 to 57 and SEQ ID NOs. 103 to 122, or (b) at least the amino 
acid sequence of the peptide binding motif selected from the group consisting of SEQ ID 
5 NOs. 49 to 57 and SEQ ID NOs. 103 to 122, with additional N-and C-terminal flanking 
sequences of a corresponding sequence selected from the group consisting of SEQ ID NOs. 
1 to 39 and SEQ ID NOs. 58 to 102. Preferably, the MHC class II antigenic peptide has a 
length of less than 26 amino acids, more preferably a length of 11 to 25 amino acids. Even 
more preferred is the antigenic peptide of the invention with a length of 11 to 19 amino 
10 acids. Most preferred is the antigenic peptide of the invention consisting of the peptide 
binding motif comprising the fovir anchor amino acids. 

The present invention also provides a MHC class II antigenic peptide comprising (a) 
at least the amino acid sequence of the peptide binding motif of SEQ ID NO. 49, or (b) at 
least the amino acid sequence of the peptide binding motif of SEQ ID NO. 49 with 
15 additional N-cind C-terminal flanking sequences of a corresponding sequence selected from 
the group consisting of SEQ ID NOs. 1 to 3. . 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino add sequence of the peptide binding motif of SEQ ID NO. 103, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 103 with additional N- 
20 and C-terminal flanking sequences of the corresponding sequence of SEQ ID NOs. 58 and 
59. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 104, or (b) at least the 
amino add sequence of the peptide binding motif of SEQ ID NO. 104 with additional N- 
25 and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 60. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 105, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 105 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 61, 

30 Furthermore, a MHC class 11 antigenic peptide is provided comprising (a) at least the 

amino add sequence of the peptide binding motif of SEQ ID NO. 106, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 106 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 62. 

Furthermore, a MHC dass 11 antigenic peptide is provided comprising (a) at least the 
35 amino add sequence of the peptide binding motif of SEQ ID NO. 107, or (b) at least the 
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amino acid sequence of the peptide binding motif of SEQ ID NO. 107 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 63. 

Furthermore, a MHC class n antigenic peptide is provided comprising (a) at least the 
amino add sequence of the peptide binding motif of SEQ ID NO. 50, or (b) at least the 
5 amino add sequence of the peptide binding motif of SEQ ID NO. 50 with additional N-and 
C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 5. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 108, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 108 with additional N- 
10 and C-terminal flanking sequences of the corresponding sequence of SEQ ID NOs. 64 to 
67. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino add sequence of the peptide binding motif of SEQ ID NO. 109, or (b) at least the 
amino add sequence of tiie peptide binding motif of SEQ ID NO. 109 with additional N- 
15 and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 68. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino add sequence of the peptide binding motif of SEQ ID NO. 110, or (b) at least the 
amino add sequence of the peptide binding motif of SEQ ID NO. 110 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NOs. 69 and 
20 70. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. Ill, or (b) at least the 
an:mio add sequence of the peptide binding motif of SEQ ID NO. Ill with additional N- 
and C-tern^nal flanking sequences of the corresponding sequence of SEQ ID NO. 72. 

25 Furthermore, a MHC class 11 antigenic peptide is provided comprising (a) at least the 

amino acid sequence of the peptide binding motif of SEQ ID NO. 112, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 112 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 73. 

The MHC class II associated novel antigenic peptides of the invention originate from 
30 interferon-y-inducible lysosomal thiol reductase (SEQ ID NOs. 1 to 3), integrin beta-2 
(SEQ ID NOs. 58 and 59), phosphatityUnositol-4,5-bisphosphate 3-kinase (SEQ ID NO: 
60), urokinase-type plasminogen activator (SEQ ID NO: 61), inmixmo^obulin heavy chain 
V-m region (Vh26) (SEQ ID NO: 62), DM protein (SEQ ID NO: 63), apoUpoprotein B- 
100 (SEQ ID NOs. 4 and 5), 26S proteasome non-ATPase regulatory subimit 8 (SEQ ID 
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NOs. 64 to 67), interlexikin-l receptor (SEQ ID NO: 68), fibromodulin (SEQ ID NOs.: 69 
and 70), GM-CSF/IL-3/IL-5 receptor (SEQ ID NOs. 71 and 72), sorting nexin 3 (SEQ ID 
NO: 73), inter-a-trypsin inhibitor heavy chain H4 (SEQ ID NOs. 6 to 12), complement C4 
(SEQ ID NOs. 13 to 18), complement C3 (a-chain) (SEQ ID NOs. 19 to 23, 74 and 75), 
complement C3 ((J-chain) (SEQ ID NOs. 76 and 77), SH3 domain-binding glutamic acid- 
rich-like protein 3 (SEQ ID NOs. 24 to 27), interleukin-4-induced protem 1 (SEQ ID NOs. 
28 to 30), hemopexin (SEQ ID NOs. 31 to 35 and 78), Hsc70-interacting protein (SEQ ID 
NOs. 36 to 39), invariant chain (li) (SEQ ID NOs. 79 to 83), retinoic acid receptor 
responder protein 2 (SEQ ID NOs. 84 to 86), fibronectin (SEQ ID NOs. 87 to 91), 
cathepsin B (SEQ ID NO: 92), tripeptid^d-peptidase 11 (SEQ ID NOs. 93 and 94), legumain 
(SEQ ID NO: 95), platelet activating factor receptor (SEQ ID NO: 96), poly-alpha-2.8- 
sialyltransferase (SEQ ID NO: 97), and ras-leated protein Rab-llB (SEQ ID NOs. 98 to 
102). 

The single peptide binding groove of MUC class II molecules is about 25 A long, but 
in contrast to MHC class I molecules, both sides are open (Stern LJ et al.. Nature 1994; 368, 
215-221). Thus, naturally processed antigenic peptides eluted from human MHC class 11 
molecules have a minimal length of about 11 residues and attain a maximal length of about 
25 residues (Chicz RM et al., J Exp Med 1993; 178, 27-47). 

The stabiUty of the MHC-peptide interaction is determined by more than a dozen 
hydrogen bonds involving the peptide backbone and the complementarity between 
specificity pockets of the binding groove and appropriately located amino acid side-chains 
of the peptide. The amino acids of the peptide fitting into the respective pockets were 
named "anchor" residues. With regard to most HLA-DR alleles, these anchors are located at 
relative positions PI, P4, P6 and P9. The combination of amino acids at these 4 anchor 
positions conferring high-stability binding to the respective HLA-DR allelic product and 
vary firom allele to allele. The peptide binding motif is defined herein as the sequence of 
nine amino acids comprising the four anchor amino acids. The peptide binding motif of 
the MHC class II antigenic peptide of the invention is depicted in SEQ ID NO. 49 for the 
peptides derived firom interferon-y-inducible lysosomal thiol reductase (SEQ ID NOs. 1 to 
3), in SEQ ID NO. 103 for the peptides derived from integrin beta-2 (SEQ ID NOs. 58 and 
59), in SEQ ID NO. 104 for the peptides derived from phosphatitylinositol-4,5- 
bisphosphate 3-kinase (SEQ ID NO: 60), in SEQ ID NO. 105 for the peptides derived from 
urokinase-type plasminogen activator (SEQ ID NO: 61), in SEQ ID NO. 106 for the 
peptides derived from immunoglobulin heavy chain V-IH region (Vh26) (SEQ ID NO: 62), 
m SEQ ID NO. 107 for the peptides derived from DJ-1 protein (SEQ ID NO: 63), in SEQ 
ID NO. 50 for the peptides derived from apolipoprotein B-lOO (SEQ ID NOs. 4 and 5), in 
SEQ ID NO. 108 for the peptides derived from 26S proteasome non-ATPase regulatory 
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subunit 8 (SEQ ID NOs. 64 to 67), in SEQ ID NO. 109 for the peptides derived from 
interleuldn-1 receptor (SEQ ID NO: 68), in SEQ ID NO. 110 for the peptides derived from 
fibromoduhn (SEQ ID NOs.: 69 and 70), in SEQ ID NO. Ill for the peptides derived from 
GM-CSF/IL-3/IL-5 receptor (SEQ ID NOs. 71 and 72), in SEQ ID NO. 112 for the peptides 
derived from sorting nexin 3 (SEQ ID NO: 73), in SEQ ID NO. 51 for the peptides derived 
from inter-a-trypsin inhibitor heavy chain H4 (SEQ ID NOs. 6 to 12), in SEQ ID NO. 52 
for the peptides derived from complement C4 (SEQ ID NOs. 13 to 18), in SEQ ID NO. 53 
for the peptides derived from complement C3 (a-chain) (SEQ ID NOs. 19 to 23, 74 and 
75), in SEQ ID NO. 113 for the peptides derived from complement C3 (P-diain) (SEQ ID 
NOs. 76 and 77), in SEQ ID NO. 54 for the pq>tides derived from SH3 domain-binding 
glutamic acid-rich-like protein 3 (SEQ ID NOs. 24 to 27), in SEQ ID NO. 55 for the 
peptides derived from interIeukin-4-induced protein 1 (SEQ ID NOs. 28 to 30), in SEQ ID 
NO. 56 for the peptides derived from hemopexin (SEQ ID NOs. 31 to 35 and 78), in SEQ 
ID NO. 57 for the peptides derived from Hsc70-interacting protein (SEQ ID NOs. 36 to 
39), in SEQ ID NO. 114 for the peptides derived from invariant chain (li) (SEQ ID NOs. 79 
to 83), in SEQ ID NO. 115 for the peptides derived from retuioic acid receptor responder 
protein 2 (SEQ ID NOs. 84 to 86), in SEQ ID NO. 116 for the peptides derived from 
fibronectin (SEQ ID NOs. 87 to 91), in SEQ ID NO. 117 for the peptides derived from 
cathepsin B (SEQ ID NO: 92), in SEQ ID NO. 118 for the peptides derived from 
tripeptidyl-peptidase II (SEQ ID NOs.93 and 94), in SEQ ID NO. 119 for the peptides 
derived from legumain (SEQ ID NO: 95), in SEQ ID NO. 120 for the peptides derived from 
platelet activating factor receptor (SEQ ID NO: 96), in SEQ ID NO. 121 for the peptides 
derived from poly-alpha-2.8-sialyltransferase (SEQ ID NO: 97) and in SEQ ID NO. 122 for 
the peptides derived from Ras-related protein Rab-llB (SEQ ID NO: 98 to 102). 

The peptide binding motif may also comprise at least one, at least two, at least three, 
at least four or at least five modifications of the amino add sequence while still attaining 
the binding capadty of the non-modified peptide binding motif. Preferably, the modified 
peptide binding motif comprises at least three of the four anchor amino adds of the non- 
modified peptide binding motif. The amino add modification may be a conservative 
amino add substitution as described below. 

Additional binding energy is provided by hydrogen bonds involving residues in front 
of the PI anchor and behind the P9 anchor. In agreement with that, in most naturally 
processed peptides the nonameric core-region (P1-P9) is N- and C-terminally flanked by 
3-4 residues. Hence, the majority of peptides are 15-17-mers. Longer peptides protrude 
from the groove, thereby allowing access of exopeptidases which are trimming both ends. 

Therefore, the MHC dass II antigenic peptide of the invention comprising (a) at least 
the amino add sequence of the peptide binding motif selected from Ihe group consisting of 
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SEQ ID NOs. 49 to 57 and SEQ ID NOs. 103 to 122, or (b) at least the amino acid sequence 
of the peptide binding motif selected from the group consisting of SEQ ID NOs. 49 to 57 
and SEQ ID NOs. 103 to 122 with additional N- and C- terminal flanking sequences of a 
corresponding sequence selected from the group consisting of SEQ ID NOs. 1 to 39 and 
SEQ ID NOs. 58 to 102, preferably comprises additional N- and C- terminal flanking 
amino acid residues providing additional binding energy. 

Preferably, the MHC class II antigenic peptide of the present invention has a binding 
capacity to the corresponding MHC class II molecule of between one tenth and ten-fold the 
IC50 of a corresponding peptide selected from the group consisting of SEQ ID NOs. 1 to 39 
and SEQ. ID NOs. 58 to 102. The binding capacity of a peptide is measured by determining 
the concentration necessary to reduce binding of a labelled reporter peptide by 50%. This 
value is called IC50. A MHC class 11 antigenic peptide of the invention maintains its binding 
capacity to the relevant HLA class II molectiles as long as it attains IC50 values between one 
tenth and 10-fold the IC50 of the established reference peptides. 

Since peptide trimming occurs in an individual fashion both before and after binding 
into the peptide binding groove, the occurrence of several truncation variants sharing a 
common nonameric core region is a common feature of MHC class Il-bound peptides. 
Importantly, it was shown that C- or N-terminal truncation variants of the same epitope 
can trigger divergent T cell responses (Arnold et al., (2002) J. InununoL 169, 739-749). 

Several parameters can be envisaged that have an influence on the relative abundance 
of truncation variants of a particular epitope, e.g. the abundance and integrity of the 
antigen of relevance, antigen-associated proteins, the abundance of proteases, the type of 
proteases available and the supply with competitive antigens and/or peptides- Since the 
antigen supply is a major characteristic that may correlate with the origin of a sample, the 
ratio of particular truncation variants of an epitope can be of diagnostic value. 

A peptide of the invention is a peptide which either has no naturally-occurring 
counterpart (e.g., such as an mutated peptide antigen), or has been isolated, i.e., separated 
or purified from components which naturally accompany it, e.g., in tissues such as 
pancreas, Uver, spleen, ovary, testis, muscle, joint tissue, neural tissue, gastrointestinal 
tissue, or body fluids sudhi as blood, serum, synovial fluid or urine. Typically, the peptide is 
considered "isolated" when a preparation comprising a peptide of the invention consists to 
at least 70%, by dry weight of said peptide and to less than 30% of the proteins and 
naturally-occurring organic molecules with which it is naturally associated. Preferably, a 
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preparation of a peptide of the invention consists of at least 80%, more preferably at least 
90%, and most preferably at least 99%, by dry weight, the peptide of the invention. Since a 
peptide that is chemically synthesized is, by its nature, separated from the components that 
naturally accompany it, the synthetic peptide is "isolated", 

5 

The invention further provides analogs of the antigenic peptide of the invention. The 
term analog includes any peptide which displays the functional aspects of these antigenic 
peptides comprising the binding capacity IC50 and the recognition by antibodies and cells 
of the immune system. Analogs exhibit essentially the same IC50 as the corresponding 
10 reference peptide. The term analog also includes conservative substitutions or chemical 
derivatives of the peptides. 

The term "analog" includes any polypeptide having an amino acid residue sequence 
substantially identical to the sequences described herein in which one or more residues 
have been conservatively substituted with a functionally similar residue and which displays 

15 the functional aspects of the peptides as described herein. Examples of conservative 
substitutions include the substitution of one non-polar (hydrophobic) residue such as 
phenylalanine, tyrosine, isoleucine, valine, leucine or methionine for another, the 
substitution of one polar (hydrophihc) residue for another such as between arginine and 
lysine, between glutamine and asparagine, between threonine and serine, the substitution 

20 of one basic residue such as lysine, arginine or histidine for another, or the substitution of 
one acidic residue, such as aspartic acid or glutamic acid for another. 

The phrase "conservative substitution" also includes the use of a chemically 
derivatized amino acid in place of a non-derivatized amino acid. "Chemical derivative" 
refers to a subject polypeptide having one or more amino adds chemically derivatized by 

25 reaction of a functional side group. Examples of such derivatized molecules include for 
example, those molecules in which free amino groups have been derivatized to form amine 
hydrochlorides, p-toluene sulfonyl groups, carbobenzoxy groups, t-butyloxycarbonyl 
groups, chloroacetyl groups, acetyl groups or formyl groups. Free carboxyl groups may be 
derivatized to form salts, methyl and ethyl esters or other types of esters or hydrazides. Free 

30 hydroxyl groups may be derivatized to form O-acyl or O-alkyl derivatives. The imidazole 
nitrogen of histidine may be derivatized to form N-im-benzylhistidine. Also included as 
chemical derivatives are those proteins or peptides, which contain one or more naturally- 
occurring amino acid derivative of the twenty standard amino acids. For examples: 4- 
hydroxyproline may be substituted for proline; 5-hydroxylysine may be substituted for 

35 lysine; 3-methylhistidine may be substituted for histidine; homoserine may be substituted 
for serine; and ornithine or citrulline maybe substituted for lysine. 
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The MHC class II antigenic peptides of the invention and the proteins they are 
derived from can be used as markers in diagnosis of RA and in therapy as anti-RA vaccines. 
The term marker as used herein refers to a biomolecule, preferably a peptide or a 
5 polypeptide, which is expressed in a group of patients with a diagnosed disease, e.g. RA, 
and attains an abimdance that is significantiy increased or decreased as compared to a 
control group. 

The marker of the present invention may be used as a prognostic marker to predict 
the susceptibility to a disease, e.g., to predict the susceptibility to RA, as a diagnostic marker 
10 for the diagnosis of a disease, e.g. for the diagnosis of RA, as a differential diagnostic marker 
to differentiate between different forms of a disease, e.g., to differentiate between different 
forms of RA, as a prognostic marker for the prediction of the outcome of a disease, e.g., for 
the prognosis of RA, and as a response marker to determine the efi&cacy of a therapeutic 
regime, e.g., as a response marker in the treatment of RA. 

15 

In a further embodiment, the MHC class II antigenic peptide comprising (a) at least 
the amino acid sequence of the peptide binding motif selected from the group consisting of 
SEQ ID NOs. 49 to 57 and SEQ ID NOs. 103 to 122, or (b) at least the amino acid 
sequence of the peptide binding motif selected from the group consisting of SEQ ID NOs. 
20 49 to 57 and SEQ ID NOs. 103 to 122, with additional N-and C-terminal flanking 
sequences of a corresponding sequence selected from the group consisting of SEQ ID NOs. 
1 to 39 and SEQ ID NOs. 58 to 102, is used as a marker for erosive and/or non-erosive RA. 

In a frirther embodiment, the MHC class II antigenic peptide comprising (a) at least 
the amino acid sequence of the peptide binding motif of SEQ ID NO. 49, or (b) at least the 
25 amino acid sequence of the peptide binding motif of SEQ ID NO. 49 with additional N-and 
C-terminal flanking sequences of a corresponding sequence selected from the group 
consisting of SEQ ID NOs. 1 to 3 is used as a marker for non-erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 103, or (b) at least the 
30 amino acid sequence of the peptide binding motif of SEQ ID NO. 103 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NOs. 58 and 
59 is used as a marker for non-erosive RA. 

Furthermore, a MHC class 11 antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 104, or (b) at least the 
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amino acid sequence of the peptide binding motif of SEQ ID NO. 104 with additional N- 
and C-terihinal flanking sequences of the corresponding sequence of SEQ ID NO. 60 is 
used as a marker for non-erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
5 amino add sequence of the peptide binding motif of SEQ ID NO. 105, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 105 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 61 is 
used as a marker for non-erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
10 amino acid sequence of the peptide binding motif of SEQ ID NO. 106, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 106 with additional N- 
and C-tenninal flanking sequences of the corresponding sequence of SEQ ID NO. 62 is 
used as a marker for non-erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
15 amino acid sequence of the peptide binding motif of SEQ ID NO. 107, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 107 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 63 is 
used as a marker for non-erosive RA. 

In a further embodiment, the MHC class 11 antigenic peptide comprising (a) at least 
20 the amino add sequence of the peptide binding motif of SEQ ID NO. 50, or (b) at least the 
amino add sequence of the peptide binding motif of SEQ ID NO. 50 with additional N-and 
C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 5 is used as a 
marker for erosive RA. 

Furthermore, a MHC dass II antigenic peptide is provided comprising (a) at least the 
25 amino add sequence of the peptide binding motif of SEQ ID NO, 108, or (b) at least the 
amino add sequence of the peptide binding motif of SEQ ID NO. 108 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NOs. 64 to 67 
is used as a marker for erosive RA. 

Furthermore, a MHC class 11 antigenic peptide is provided comprising (a) at least the 
30 amino add sequence of the peptide binding motif of SEQ ID NO. 109, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 109 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 68 is 
used as a marker for erosive RA. 



% wo 2005/014622 



19 



PCT/EP2004/008609 



Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 110, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 110 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NOs. 69 and 
5 70 is used as a marker for erosive RA. 

Furthermore, a MHC class 11 antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. Ill, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. Ill with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 72 is 
10 used as a marker for erosive RA. 

Furthermore, a MHC class II antigenic peptide is provided comprising (a) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 112, or (b) at least the 
amino acid sequence of the peptide binding motif of SEQ ID NO. 112 with additional N- 
and C-terminal flanking sequences of the corresponding sequence of SEQ ID NO. 73 is 
15 used as a marker for erosive RA. 



In a further embodiment, the MHC class II antigenic peptides of the invention as 
20 described above are provided linked to a MHC class II molecule. 

Multimers (e.g., dimers, trimers, tetramers, pentamers, hexamers or oligomers) of a 
dass II MHC molecule containing a covalently or non-covalently bound peptide according 
to the present invention, if conjugated with a detectable label (e.g., a fluorescent moiety, a 
radionuclide, or an enzyme that catalyzes a reaction resulting in a product that absorbs or 

25 emits Ught of a defined wavdength) can be used to quantify T cdls firom a subject (e.g., a 
human patient) bearing cell surfiice receptors that are specific for, and therefore will bind, 
such complexes. Relativdy hi^ numbers of such T cells are likely to be diagnostic of 
disease or an indication that the T cells are involved in immunity to the disease. In 
addition, continuous monitoring of the relative numbers of multimer-binding T cdls can 

30 be usefiil in establishing the course of a disease or the efficacy of therapy. Such assays have 
been developed using tetramers of class I MHC molecules containing an HTV-l-derived or 
an influenza virus-15 derived peptide (Altman et al. (1996), Science 274:94-96; et al. 
(1998), Science 279:2103- 21061), and corresponding dass n MHC multimers would be 
expected to be similarly useful. Such complexes could be produced by chemical cross- 

35 linking of purified dass 11 MHC molecules assembled in the presence of a peptide of 
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interest or by modification of already established recombinant techniques for the 
production of dass II MHC molecules containing a single defined peptide (Kazono et al. 
(1994), Nature 369:151-154; Gauthier et al. (1998), Proc. Natl. Acad. Sci. U.S.A. 95:11828- 
118331). The class 11 MHC molecule monomers of such multimers can be native molecules 
composed of full-length alpha and beta chains. Alternatively, they can be molecules 
containing either the extracellular domains of the alpha and beta chains or the alpha and 
beta chain domains that form the "walls" and "floor" of the peptide-binding cleft 

The invention also relates to an antibody, firagments or derivatives thereof, directed 
to and reactive with the above-described MHC class II antigenic peptides. The general 
methodology for producing antibodies is weU known and is disclosed per example in 
Kohler and Milstein, 1975, Nature 256,494 or in J. G. R. Hurrel, Monoclonal Hybridoma 
Antibodies: Techniques and Applications, CRC Press Inc., Boco Raron, FL (1982). The 
cintibodies can be polyclonal or, preferably, monoclonal, or antibody firagments like be F 
(ab*) 2, Fab, Fv or scFv. The antibodies of the present invention may also be humanized 
(Merluzzi S. et al., (2000), Adv. Clin. Path., 4(2): 77-85) or human antibodies (Aujame L. et 
al., Hvun. Antibodies, (1997), 8(4): 155-168). 

The present invention also provides a nucleic acid molecule encoding a MHC class II 
antigenic peptide of the invention comprising (a) at least the amino add sequence of the 
peptide binding motif sdected firom the group consisting of SEQ ID NOs. 49 to 57 and 
SEQ ID NOs. 103 to 122, or (b) at least the amino acid sequence of the peptide binding 
motif selected firom the group consisting of SEQ ID NOs. 49 to 57 and SEQ ID NOs. 103 to 
122, with additional N- and C-terminal flanking sequences of a correspoiiding sequence 
selected from the group consisting of SEQ ID NOs. 1 to 39 and SEQ. ID NOs. 58 to 102. 
Preferably, the nucleic add molecule is a DNA molecule. 

Furthermore, a nucleic acid molecule is provided encoding a MHC class n antigenic 
peptide of the invention linked to a MHC dass II molecule. 

This invention also provides a recombinant nudeic add construct comprising the 
nucleic acid molecules as described above, operably linked to an expression vector. 
Expression vectors suitable for use in the present invention comprise at least one 
expression control element operably linked to the nudeic acid sequence encoding the 
antigenic peptide or the antigenic peptide linked to a MHC dass II molecule. The 
recombinant expression construct may be a DNA constract. 

The expression control elements are inserted in the vector to control and regulate the 
expression of the nucleic add sequence encoding the antigenic peptide of the invention. 
Examples of expression control dements indude> but are not limited to, lac system. 
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operator and promoter regions of phage lambda, yeast promoters and promoters derived 
from polyoma, adenovirus, retrovirus or SV40. Additional preferred or required 
operational elements include, but are not limited to, leader sequence, termination codons, 
polyadenylation signals and any other sequences necessary or preferred for the appropriate 
transcription and subsequent translation of the nucleic acid sequence in the host system. It 
will be understood by one skilled in the art that the correct combination of required or 
preferred expression control elements will depend on the host system chosen. It will further 
be understood that the expression vector should contain additional elements necessary for 
the transfer and subsequent repUcation of the expression vector containing the nudeic add 
sequence in the host system. Examples of such dements indude, but are not limited to, 
origins of replication and sdectable markers. It will further be understood by one skilled in 
the art that such vectors are easily constructed using conventional methods ("DNA 
Isolation and Sequencing", Bruce A. Roe, Judy S. Crabtree and Akbar S. Khan, Published by 
John Wiley & Sons, 1996) or are commerdally available. 

Another aspect of this invention rdates to a host organism or a host cell into which a 
recombinant nudeic acid construct comprising the nucleic acid molecules as described 
above, operably linked to an expression vector, has been inserted. The host cells 
transformed with the nucleic add constructs encompassed by this invention include 
eukaryotes, such as animal, plant, insect and yeast cells and prokaryotes, such as E. coli. 
The means by which the nucleic add construct carrying the nucleic acid sequence may be 
introduced into the cell include, but are not limited to, microinjection, electroporation, 
transduction, or transfection using DEAE-dextran, Upofection, calcium phosphate or other 
procedures known to one sMQed in the art (Sambrook et al, (1989) in "Molecular Cloning. 
A Laboratory Manual", Cold Spring Harbor Press, Plainview, New York). 

In a preferred embodiment, eukaryotic expression vectors that function in eukaryotic 
cells are used. Examples of such vectors include, but are not limited to, retroviral vectors, 
vacdnia virus vectors, adenovirus vectors, herpes virus vector, fowl pox virus vector, 
plasmids, or the baculovirus transfer vectors. Preferred eukaryotic cell lines include, but are 
not Umited to, COS cells, CHO ceUs, HeLa ceUs, NTH/3T3 ceUs, 293 cells (ATCC# 
CRL15731), T2 cells, dendritic cells, monocytes or Epstein-15 Barr Virus transformed B 
cells. 

An antigenic peptide of the invention can be obtained, for example, by extraction 
from a natural source (e.g., elution from MHC II molecules); by expression of a 
recombinant nuddc add encoding the peptide; or by chemical synthesis. A peptide that is 
produced in a cellular system different from the source from which it naturally originates is 
"isolated," because it will be separated from components which naturally accompany it The 
recombinant peptide expressed by a host organism can be obtained as a crude lysate or can 
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be purified by standard protein purification procedures known in the art which may- 
include diflFerential precipitation, size exclusion chromatography, ion-exchange 
chromatography, isoelectric focusing, gel electrophoresis, affinity, and inamimoafiSnity 
chromatography and the like. The extent of isolation or purity can be measured by any 
appropriate method, e.g. mass spectrometry or HPLC analysis. The peptides may be 
prepared synthetically by procedures described in Merrifi.eld, (1986) Science 232: 341-347, 
and Barany andMerrifidd, The Peptides, Gross and Meienhofer, eds (N. Y., Academic 
Press), pp. 1-284 (1979). The synthesis can be carried out in solution or in solid phase or 
with an automatized synthesizer (Stewart and Young, Solid Phase Peptide Synthesis, 2nd 
ed., Rockford 111., Pierce Chemical Co. (1984)). 

Therefore, the present invention fiirther provides a method for producing a MHC 
class II antigenic peptide comprising (a) at least the amino acid sequence of the peptide 
binding motif selected firom the group consisting of SEQ ID NOs. 49 to 57 and SEQ ID 
NOs. 103 to 122, or (b) at least the amino acid sequence of the peptide binding motif 
selected firom the group consisting of SEQ ID NOs. 49 to 57 and SEQ ID NOs, 103 to 122, 
with additional N-and C- terminal flanking sequences of a. corresponding sequence selected 
from the group consisting of SEQ ID NOs. 1 to 39 and SEQ ID NOs. 58 to 102, comprising 
the steps of cultining the host cell containing a recombinant nucleic acid construct as 
described above under conditions allowing expression of said peptide and recovering the 
peptide from the cells or the culture mediimi. 



In a fiirther embodiment of the present invention, a method is provided for isolating 
and identifying MHC class II associated RA antigenic peptides in femtomolar amounts, 
which method comprises (a) providing immature dendritic cells in a number comprising 
0.1 to 5 [xg MHC class 11 molecules; (b) contacting the cells of (a) with serum or synovial 
fluid and inducing maturation of dendritic cells by adding TNFalpha; (c) isolating dass 11 
MHC molecule-antigenic peptide complexes from the cells with methods comprising 
solubilization of the cells and sequestration of the complexes of MHC class II molecules 
with antigenic peptides by immimopredpitation or immunoaffinity chromatograph)^ (d) 
washing the sequestered complexes of MHC class n molecides with antigenic peptides with 
water in an ultrafiltration tube; (e) duting the assodated antigenic peptides from the MHC 
class n molecules at 37''C with diluted trifluoro acetic add, and (f) separating, detecting 
and identifying the isolated peptides by liquid chromatography and mass spectrometry, 
Ftuthermore, in step (f) of the method, the liquid duromatography comprises a first linear 
dution step from the reversed-phase material with a volume suffident to elute the majority 
of contaminants prior to peptide dution. Moreover, the method may further comprise (g) 
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analyzing the identified peptides by methods comprising a database and a software 
developed to perform comparative data analysis across mtdtiple datasets. 

The amount of tissue or bodily fluid necessary to obtain e.g. 100 ng MHC class II 
molecules depends on the number of cells that do express MHC class 11 and on the 
expression rate of MHC class 11 molecules: e.g. 100 ng of MHC class II are equivalent to 
about 2 X 10^ mature DCs or 5 to 10 x 10^ peripheral blood monocytes or about 5 x 10^ 
peripheral blood mononuclear cells which can be obtained from about 50 ml of blood. 

For the purification of class II MHC molecule-antigenic peptide complexes from cells 
or tissue, the membranes of the ceDs or tissue have to be solubilized. Cell lysis may be 
carried out with methods known in the art, e.g- freeze-and-thaw cycles and the use of 
detergents, and combinations thereof. Preferred lysis methods are solubilization using 
detergents, preferably TX- 100, NP40, n-octylglucoside, Zwittergent, Lubrol, CHAPS, most 
preferably TX-lOO or Zwittergent 3-12. Cell debris and nuclei have to be removed from cell 
lysates containing the solubilized receptor-peptide complexes by centrifiigation. Therefore, 
tiie complexes of class II MHC molecules with antigenic peptides are isolated from the cells 
with methods comprising solubilization with a detergent 

Furthermore, the MHC class II molecule-peptide complexes are purified from cell 
lysates by methods comprising immunoprecipitation or immunoaflfinity chromatography. 
For the immunoprecipitation or immunoafSnity chromatography, antibodies specific for 
MHC class II molecules and suitable for these methods are used. The specific antibodies are 
preferably monoclonal antibodies, and are covalently or non-covalently e.g. via Protein A, 
coupled to beads, e.g. sepharose or agarose beads. A selection of the broad panel of anti- 
HLA antibodies used in the prior art comprises: anti-HLA-DR antibodies: L243, TU36, 
DA6.147, preferably L243; anti-HLA-DQ antibodies: SPVL3, TU22, TU169, preferably 
TU22 and TU169; anti-HLA-DP antibody B7/21 and anti-HLA-A3,C antibodies W6/32 
and B9.12. 

Monoclonal antibodies specific for different MHC class II molecules may be 
commercially obtained (e.g. Pharmingen, Dianova) or purified from the supernatant of the 
respective hybridoma cells using Protein A- or Protein G- af&nity chromatography. 
Purified monoclonal antibodies may be coupled by various methods known in the art, 
preferably by covalently coupling antibody amino groups to CNBr-activated sepharose. 

Immunoisolation of MHC molecules may be performed by incubating the antibody- 
beads with the cell lysate under rotation for several hours or diromatographically by 
pumping the cell lysate through a micro-colimm. Washing of the antibody-beads may be 
performed in eppendorf tubes or in the microcolimm. The efficacy of the 
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immunoprecipitation may be analysed by SDS-PAGE and western blotting using 
antibodies recognizing denatured MHC moleailejs (anti-HLA-DRalpha: 1B5; anti-HLA 
classI:HC10orHCA2). 

The sequestered MHC class II molecule-peptide complexes are washed vnth water or 
5 low-salt buffer before elution in order to remove residual detergent contaminants. The low 
salt buffer may be a Tris, phosphate or acetate buffer in a concentration range of 0.5 - 10 
mM, preferably in a concentration of 0.5 xnM. In a more preferred embodiment, the MHC 
class n molecule -peptide complexes are washed witii ultrapure water (sequencing grade) 
conventionally used for HPLC analysis, preferably with ultrapure (sequencing grade) water 
10 from MERCK. The washing step may be carried out by ultrafiltration. The ultrafiltration 
may be carried out in an ultrafiltration tube with a cut-off of 30 kD, 20 kD, 10 kD or 5 kD, 
preferably of 30 kD and a tube volmne of 0.5 - 1.0 ml ("Ultrafree" tubes; Millipore). The 
washing in the ultrafiltration tube may be carried out 4 to 12 times, preferably 6 to 10 
times, with a volume of 10 to 20 times the volume of the beads carrying the receptor- 
15 peptide complexes, preferably with a volume of 15 times the beads. The eluted peptides 
maybe separated fi-om the remaining MHC class II molecules using the same.ultrafiltration 
tube. The eluted peptides may then be lyophilized. 

By eluting tiie peptides from the MHC class II molecules, a complex mixture of 
naturally processed peptides derived from the source of potential antigen and from 
20 polypeptides of intra- or extracellular origin, is obtained. Only after elution, peptides can 
be separated and subjected to sequence analysis. 

The antigenic peptides in the method of the present invention may be eluted by a 
variety of methods known in the art, preferably by using diluted add, e.g., diluted 
acetonitrile (Jardetzky TS et al., Natiure 1991 353, 326-329), diluted acetic acid and heating 
25 (Rudensky AY et ali. Nature 1991, 353, 622-626; Chicz RM et al., Nature 1992, 358, 764- 
768) or diluted trifluoro acetic acid at 37°C (Kropshofer H et al., J Exp Med 1992, 175, 
1799-1803). Most preferably, tiie peptides are eluted at 37°C witii diluted trifluoro acetic 
acid. 

The isolated antigenic peptides are then separated, detected and identified. By 
30 detecting it is understood that the amino acid sequence of the individual peptides in the 
mixture of isolated antigenic peptides is elucidated by methods adequate to detect and 
sequence femtomolar amoimts of peptides. By identifying it is understood tiiat it is 
established from which proteins or polypeptides the antigenic peptides are derived and 
which sequence they constitute within these proteins or polypeptides. 
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In a first step, the complex mixture of eluted peptides may be separated by one of a 
variety of possible chromatographic methods, e.g. by reversed phase, anion exchange, 
cation exchange chromatography or a combination thereof. Preferably, the separation is 
performed by CIS-reverse phase chromatography or by reversed-phase / cation exchange 
two-dimensional HPLC, denoted as MudPit (Washburn MP et al., Nat BiotechnoL, (2001), 
19, 242-247). 

The separation is done in a HPLC mode utilizing fused-silica micro-capillary 
columns which are either connected to a nano-flow electrospray source of a mass 
spectrometer or to a micro-fractionation device which spots the fractions onto a plate for 
MALDI analysis. 

Liquid chromatography comprises peptide fractionation by the use of a strong ion 
exchange material and a hydrophobic reversed-phase material. For the dution of the 
peptides from the ion exchange and reversed-phase material different elution programs are 
run one after another comprising elutions with salt and with organic solvents, e,go 
acetonitrile. The elution from the reversed-phase material is conducted in several steps of 
linear gradients of different lengths and slopes. A contamination in the sample to be 
fractionated may be any contamination whose elution competes witii the detection of the 
peptide peaks in the mass spectrometer. Therefore, in order to prevent simultaneous 
elution, the contanninants have to be eluted with a sufficient solvent volume prior to the 
peptide elution step. Depending on the column used for liqviid chromatography the solvent 
volume suflBdent to elute the contaminants prior to the peptide elution step maybe 100 to 
200 times the colmnn volume, 

A variety of mass spectrometric techniques are suitable, preferably MALDI-post 
source decay (PSD) MS or electrospray ionization tandem mass spectrometry (ESI-MS), 
most preferably ion-trap ESI-MS. 

The sequences of the individual peptides can be determined by means known in the 
art. Preferably, sequence analysis is performed by fragmentation of the peptides and 
computer-assisted interpretation of the fragment spectra using algorithms, e.g. MASCOT 
or SEQUEST, Both computer algorithms use protein and nucleotide sequence databases to 
perform cross-correlation analyses of experimental and theoretically generated tandem 
mass spectra. This allows automated high through-put sequence analysis. 



The isolated and identified antigenic peptides of the invention can be validated by the 
MHC binding motif, the MHC binding capacity and/or by T cell recognition. 
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MHC binding motif 

Peptides associated to a particular MHC molecule (allelic variant) have common 
structural characteristics, denoted as binding motifs, necessary to form stable complexes 
with MHC molecules. Peptide ligands eluted from MHC dass I molecules are relatively 
short, ranging from 8-11 amino acids. Moreover, 2 or 3 side chains of the peptide are 
relevant for binding. The position of the respective amino acid side chains varies with the 
HLA allele, most often two of these so-called "anchor" residues are located at positions 2 
and 9. With respect to a particular anchor position, only 1 or 2 amino acids normally can 
function as anchor amino acids e.g. leucine or valine V at position 2 in the case of HLA-A2. 

In the case of MHC class 11 molecules, the peptide length varies from 11 to 25 amino 
acids, as longer peptides can bind since both ends of the peptide binding groove are open. 
Most HLA dass II molecules accommodate up to 4 anchor residues at relative positions PI, 
P4, P6 and P9 contained in a nonameric core region. This core region, however, can have 
variable distance from the N-terminus of the peptide. In the majority of cases, 2-4 N- 
terminal residues precede the core region. Hence, the PI anchor residues is located at 
positions 3, 4 or 5 in most HLA class EI associated peptides. Peptides eluted from HLA-DR 
dass n molecules share a big hydrophobic PI anchor, represented by tyrosine, 
phenylalanine, tryptophane, methionine, leucine, isoleucine or valine. 

The position and the exact type of anchor residues constitute the peptide binding 
motif which is known for most of the frequentiy occurring HLA class II allelic products. A 
computer algorithm allowing motif vaUdation in peptide sequences is "Tepitope", available 
by vaccinome. 

MHC binding capacity 

Peptides identified by the method of the invention may be tested for their ability to 
bind to the appropriate MHC class II molecule by methods known in the art using, for 
example, isolated MHC class II molecules and synthetic peptides with amino acid 
sequences identical to those identified by the method of the invention (Kropshofer H et al., 
J. Exp. Med. 1992; 175, 1799-1803; Vogt AB et al., J. Immunol 1994; 153, 1665-1673; Sloan 
VS et al.. Nature 1995; 375, 802-806). Alternatively, a cellular binding assay using MHC 
class II expressing cell lines and biotinylated peptides can be used to verify the identified 
epitope (Amdt SO et al., EMBO J., 2000; 19, 1241-1251) 

In both assays, the relative binding capacity of a peptide is measured by determining 
the concentration necessary to reduce binding of a labelled reporter peptide by 50%. This 
value is called IC50. Peptide binding with a reasonable afifinity to the relevant HLA dass n 
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molecules attain IC50 values not exceeding lO-fold the IC50 of established reference 
peptides. 

The same binding assays can also be used to test the abiUty of peptides to bind to 
alternative class II MHC molecules, i.e., dass II MHC molecules other than those from 
which they were eluted using the method of the invention. The diagnostic methods of the 
invention using such peptides and therapeutic methods of the invention, using either the 
peptides or peptides derived from them, can be applied to subjects expressing such 
alternative class II MHC molecules. 

T cell recognition 

The epitope verification procedure may involve testing of peptides identified by the 
method of the invention for their ability to activate CD4+ T cell populations. Peptides with 
amino acid sequences either identical to those identified in the present invention or 
corresponding to a core sequence derived from a nested group of peptides identified in the 
present invention are synthesized. The synthetic peptides are tiien tested for their ability to 
activate CD4+ T cells from (a) test subjects expressing the MHC class II molecule of 
interest and having at least one symptom of the disease; and (b) control subjects expressing 
the MHC class II molecule of interest and having no symptoms of the disease. Additional 
control subjects can be those with symptoms of the disease and not expressing the MHC 
dass II molecule of interest. 

In some diseases (e.g., those with an autoimmune component) responsiveness in the 
CD4+ T cells of test subjects but not in CD4+ T cells of the control subjects described in 
(b) provides confirmatory evidence that the relevant peptide is an epitope that activates 
CD4+ T cells that can initiate, promote, or exacerbate the relevant disease. In other diseases 
(e.g., cancer or infectious diseases without an autoimmune component), a similar pattern 
of responsiveness and non-responsiveness to tiiat described in the previous sentence would 
indicate that the relevant peptide is an epitope that activates CD4+ T cells that can mediate 
immunity to the disease or, at least, a decrease in the symptoms of the disease. 

CD4H- T cell responses can be measured by a variety of in vitro methods known in the 
art. For example, whole peripheral blood mononuclear cells (PBMC) can be cultured with 
and without a candidate synthetic peptide and their proliferative responses measured by, 
e.g., incorporation of [^H] -thymidine into their DNA. That the proliferating T cells are 
CD4+ T cells can be tested by either eliminating CD4+ T cells from the PBMC prior to 
assay or by adding inhibitory antibodies that bind to the CD4+ molecule on the T cells, 
thereby inhibiting proliferation of the latter. In both cases, the proliferative response will be 
inhibited only if CD4+ T cells are the proliferating cdls. Altematively, CD4+ T ccOls can be 
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purified firom PBMC and tested for proliferative responses to the peptides in the presence 
of APC expressing the appropriate MHC class II molecule. Such APC can be B- 
lymphocytes, monocytes, macrophages, or dendritic cells, or whole PBMC. APC can also 
be immortalized cell lines derived firom B-lymphocytes, monocytes, macrophages, or 
dendritic cells. The APC can endogenously express the MHC class II molecule of interest or 
they can express transfected polynucleotides encoding such molecules. In all cases the APC 
can, prior to the assay, be rendered non-proUferative by treatment with, e.g., ionizing 
radiation or mitomycin-C. 

As an alternative to measuring cell proliferation, cytokine production by the CD4+ T 
cells can be measured by procedures known to those in art. Cytokines include, without 
limitation, interleukin-2 (IL-2), interferon-gamma (IFN-gamma), interleukin-4 (IL-4), 
TNF-alpha, interleukin-6 (IL-6), interleukin-lO (IL-10), interleukin-12 (IL-12) or TGF- 
beta. Assays to measure them include, without limitation, ELISA, and bio-assays in which 
cells responsive to the relevant cytokine are tested for responsiveness (e.g., prohferation) in 
the presence of a test sample. 

Alternatively, cytokine production by CD4+ lymphocytes can be directly visualized 
by intracellular immunofluorescence staining and flow cytometry. 

Moreover, the MHC class II antigenic peptides of the present invention may be used 
in the diagnosis of RA. Therefore, a further embodiment of the invention is the use of an 
antigenic peptide according to the present invention as a marker for RA. 

Preferably, a MHC class II antigenic peptide comprising (a) at least the amino add 
sequence of the peptide binding motif selected firom the group consisting of SEQ ID NOs. 
49 to 57 and SEQ ID NOs. 103 to 122, or (b) at least the amino acid sequence of the 
peptide binding motif selected firom the group consisting of SEQ ID NOs. 49 to 57 and 
SEQ ID NOs. 103 to 122, with additional N-and C-terminal flanking sequences of a 
corresponding sequence selected firom the group consisting of SEQ ID NOs. 1 to 39 and 
SEQ ID NOs. 58 to 102, is used as a marker for RA. 

In another embodiment, the antigenic peptides of the invention may be used as 
response markers to track the efficacy of a therapeutic regime. Essentially, a baseline value 
for an antigenic peptide can be determined, then a given therapeutic agent is administered, 
and the levels of the antigenic peptide are monitored subsequently, whereas a change in the 
level of the antigenic peptide is indicative of the efficacy of a therapeutic treatment 
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Furthermore, the antigenic peptides which are only found in certain stages or phases 
of a disease, preferably of RA, may be utilized as stage-specific markers. Essentially, the 
levels of the antigenic peptides which have been linked to a certain disease stage are 
monitored regularly, thereby providing information about the stage of the disease and its 
progression. 

The invention also includes the use of the polypeptides the RA antigenic peptides are 
derived from as markers for the diagnosis and monitoring of a disease, preferably of RA, 
and in particular, of erosive versus non-erosive RA. The rationale for the use of the 
respective proteins is that DCs reside in most tissues where they capture exogenous 
antigens via specific receptors and via specialized endocytotic mechanisms (e.g. 
macropinocytosis) followed by presentation of the processed antigens as peptides on MHC 
class II molecules. Previous studies have shown that the frequency of a peptide epitope 
found in the context of MHC class 11 molecviles, e.g. the RA antigenic peptides, in the 
majority of cases mirrors the abundance of the protein from which this particular peptide 
was derived from. Therefore, not only the RA antigenc peptides but also the corresponding 
proteins can serve as markers for RA. 

Therefore, in a fiirther embodiment of the present invention, the use of a polypeptide 
selected from the group consisting of interferon-gamma-inducible lysosomal thiol 
reductase (SEQ ID NO: 40), integrin beta-2 (SEQ ID NO. 123), phosphatitylinositol-4,5- 
bisphosphate 3-kinase (SEQ ID NO: 124), urokinase-type plasminogen activator (SEQ ID 
NO. 125), immunoglobulin heavy chain V-HI region (Vh26) (SEQ ID NO. 126), DJ-1 
protein (SEQ ID NO. 127), apolipoprotein B-lOO (SEQ ID NO: 41), 26S proteasome non- 
ATPase regulatory subunit 8 (SEQ ID NO. 128), interleukin-1 receptor (SEQ ID NO: 129), 
fibromodulin (SEQ ID NO. 130), GM-CSF/IL-3/IL-5 receptor (SEQ ID NO. 131), sorting 
nexin 3 (SEQ ID NO. 132), mter-alpha-trypsin inhibitor heavy chain H4 (SEQ ID NO: 42), 
complement C4 (SEQ ID NO: 43), complement C3 (SEQ ID NO: 44), SH3 domain- 
binding glutamic acid-rich-like protein 3 (SEQ ID NO: 45), interleukin-4-induced protein 
1 (SEQ ID NO: 46), hemopexin (SEQ ID NO: 47), Hsc70-interacting protein (SEQ ID NO: 
48), invariant chain (li) (SEQ ID NO. 133), retinoic acid receptor responder protein 2 
(SEQ ID NO. 134), fibronectin (SEQ ID NO. 135), cathepsin B (SEQ ID NO: 136), 
tripeptidyl-peptidase 11 (SEQ ID NO. 137), legumain (SEQ ID NO. 138), platelet activating 
factor receptor (SEQ ID NO. 139), poly-alpha-2.8-sial)dtransferase (SEQ ID NO. 140), ras- 
leated protein Rab-llB (SEQ ID NO. 141) as a marker for RA is provided. Preferably, the 
polypeptide is used as a marker for erosive RA. It is also preferred to use the polypetide as a 
marker for non-erosive RA. Especially preferred is the use of interleukin-4-induced protein 
1 (SEQ ID NO: 46) as a marker for RA. The Figl polypeptide has not been known as a 
marker for RA until now, and is considered as an important candidate marker for RA. 
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The diagnosis of RA can be made by examining expression and/or composition of a 
polypeptide or peptide marker for RA, by a variety of methods, including enzyme linked 
immunosorbent assays (ELISAs), Western blots, immunoprecipitations and 
immunofluorescence. A test sample from an individual is assessed for the presence of an 
alteration in the expression and/or an alteration in composition of a polypeptide or a 
peptide of the present invention. An alteration in expression of a polypeptide or peptide 
can be, for example, an alteration in the quantitative polypeptide expression (i.e., the 
amount of polypeptide produced); an alteration in the composition of a polypeptide is an 
alteration in the qualitative polypeptide expression (c.^., expression of a mutant 
polypeptide or of a different spUcing variant). 

Both such alterations (quantitative and qualitative) can also be present An 
"alteration" in the polypeptide expression or composition, as used herein, refers to an 
alteration in expression or composition in a test sample, as compared with the expression 
or composition of the peptide or polypeptide in a control sample. A control sample is a 
sample that corresponds to the test sample (e.g., is from the same type of ceUs), and is from 
an individual who is not affected by RA. An alteration in the expression or composition of 
the peptide or polypeptide in the test sample, as compared with the control sample, is 
indicative of RA or a susceptibiUty to RA. Various means of examining expression or 
composition of a peptide or polypeptide of the present invention can be used, including 
spectroscopy, colorimetry, electrophoresis, isoelectric focusing, and immunoassays {e.g.y 
David et aly U.S. Pat. No. 4,376,110) such as immunoblotting (see also Current Protocols 
in Molecular Biology, particularly chapter 10). For example, in one embodiment, an 
antibody capable of binding to the polypeptide (e.g:, as described above), preferably an 
antibody with a detectable label, can be used. Antibodies can be polyclonal, or more 
preferably, monoclonal. An intact antibody, or a fragment thereof (e.g:. Fab or F(ab*)2) can 
be used. The term "labeled", with regard to the probe or antibody, is intended to 
encompass direct labeling of the probe or antibody by coupling (i.e., physically linking) a 
detectable substance to the probe or antibody, as well as indirect labeling of the probe or 
antibody by reactivity with another reagent that is directly labeled. Examples of indirect 
labeling include detection of a primary antibody using a fluorescently labeled secondary 
antibody and end-labeling of a DNA probe with biotin such that it can be detected with 
fluorescently labeled streptavidin. 

Western blotting analysis, using an antibody as described above that specifically binds 
to a peptide or polypeptide of the present invention, may be used to measure the level or 
amount of a peptide or polypeptide in a test sample and comparing it with the level or 
amount of the peptide or polypeptide in a control sample. Preferably the peptide or 
polypeptide in a test sample is measured in a homogenous or a heterogenous immuno 
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assay. A level or amount of the polypeptide in the test sample that is higher or lower than 
the level or amount of the polypeptide in the control sample, such that the difference is 
statistically significant, is indicative of an alteration in the expression of the polypeptide, 
and is diagnostic for a RA or a susceptibility to RA, 

Therefore, the present invention also relates to a diagnostic composition comprising 
an antibody reactive with a MHC class n antigenic peptide of the invention. 

In a further embodiment the antigenic peptides of the invention or the proteins they 
are derived firom may be used in the prevention and treatment of a disease, preferably of 
RA. 

One aspect of the invention is a therapeutic purpose, wherein one or more of the 
identified antigenic peptides are used to vaccinate patients against RA, preferably against 
erosive and/or non-erosive RA. In the course of the vaccination the antigenic peptide 
would induce an antigen-specific T cell tolerance in the patient which would ultimately 
lead to regression of the disease or to an attenuation of disease development. 

A prondsing strategy to induce specific immune tolerance in future clinical trials is 
the use of DNA tolerizing vaccines. DNA tolerizing vaccines encoding autoantigens alone 
were shown to reduce T cell proliferative responses (Ruiz, P. et aL, J Inmiunol 162 (1999) 
3336-3341), while DNA tolerizing vaccines co-delivering autoantigen plus IL-4 also 
induced protective Th2 responses (Garren, H. et al.. Immunity 15 (2001) 15-22). Examples 
of non-polynudeotide-specific tolerizing therapies xmder development include protein 
antigens, naturally processed peptides, altered peptide ligands, other biomolecules, such as 
DNA, or proteins and peptides containing posttranslational modifications, and antigens 
delivered orally to induce "oral tolerance" (reviewed in: Robinson, W.H. et al., Clin 
Immunol 103 (2002) 7-12). A potential adverse effect with regard to tolerizing therapies is 
the development of autoinmiunity. 

To this end, the relevant RA antigenic peptides may be directly administered to the 
patient in an amount sufficient for the peptides to bind to the MHC molecules, and 
provoke peripheral tolerance of T cells. 

Alternatively, the antigenic peptides of the invention may be utilized for the 
generation of vaccines based on DCs. In this case, autologous DCs derived firom patients' 
monocytes may be pulsed with the relevant peptides or recombinant proteins containing 
the relevant peptide sequences. 
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Therefore, the present invention provides a pharmaceutical composition comprising 
a MHC class II antigenic peptide comprising (a) at least the amino acid sequence of the 
peptide binding motif selected from the group consisting of SEQ ID NOs. 49 to 57 and 
SEQ ID NOs. 103 to 122, or (b) at least the amino acid sequence of the peptide binding 
motif selected from the group consisting of SEQ ID NOs. 49 to 57 and SEQ ID NOs. 103 to 
122, with additional N-and C-terminal flanking sequences of a corresponding sequence 
selected from the group consisting of SEQ ID NOs. 1 to 39 and SEQ ID NOs. 58 to 102, an 
antibody reactive with said antigenic peptide, or a polypeptide selected from the group 
consisting of SEQ ID NOs 40 to 48 and SEQ. ID NOs. 123 to 141, and optionally a 
pharmaceutically acceptable excipient, diluent or carrier. The antigenic peptide has to be 
present in an amoimt sufficient to tolerize specific lymphocytes. Such an amoimt will 
depend on the peptide used, the administration, the severity of the disease to be treated and 
the general conditions of the patient and wiU usually range from 1 to 50 mg/nni, for 
example in case of peptides being loaded on dendritic cells. 

An acceptable excipient, diluent or carrier may be phosphate buffered saline for in 
vitro studies and physiological salt solutions for in vivo appUcations. 

'Vacdnation" herein means both active inmiunization, i. e. the in vivo administration 
of the peptides to elidt an in vivo immune tolerance directly in the patient and passive 
immxmization, i. e. the use of the peptides to tolerize in vitro CD4+ T lymphocytes or to 
stimulate autologous or allogeneic dendritic cells, which are subsequently re-inoculated 
into the patient. 

The present invention also provides the antigenic peptides, antibodies, nucleic acids, 
host cells, methods, compositions and uses substantially as herein before described 
especially with reference to the Examples. 

Having now generally described this invention, the same will become better 
understood by reference to the specific examples, which are induded herein for purpose of 
illustration only and are not intended to be limiting unless otherwise specified, in 
connection with the following figures. 
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Examples 

The examples below are illustrated in comiection with the figures described above 
and based on the methodology summarized in Fig 1, as described in the following. 
Commercially available reagents referred to in the examples, were used according to 
5 manufacturer's instructions unless otherwise indicated. 



Methodology of the invention 
Dendritic cells and culturing 

The study was performed with human dendritic cells which were differentiated from 
10 monocytes, as described below. Monocytes were purified from human peripheral blood. 
The blood was taken from healthy donors with the following haplotypes: (l)HLA- 
DEB1''040U ""03011, (2) HLA-DEBl''0401y m04, (3) HLA-DRBl''040h ^301, (4) HLA- 
DBBl ''0401, ""OJOh HLA'DRBl ''0401, ""0407. 

All cells were cultured in RPMI 1640 mediimi (short: RPMI) supplemented with 1 
15 mM Pyruvate, 2 mM Glutamine and 10% heat-inactivated fetal calf serum (Gibco BRL, 
RockviQe,MD). 

Isolation of peripheral blood mononuclear cells (PBMCs) 

Peripheral blood was obtained from the blood bank in Mannheim, Germany as 
standard buffy coat preparations from healthy donors. Heparin (200 I.U./ml blood, 

20 Liquemine, Roche) was used to prevent clotting. Peripheral blood mononuclear cells 
(PBMCs) were isolated by centrifiigation in LSM® (1.077-1.080 g/ml; ICN, Aurora, OH) at 
800g (room temperature) for 30 min. PBMCs were collected from the interphase and 
washed twice in RPMI containing 20 mM Hepes (500g for 15 min, 300g for 5 nun). In 
order to remove erythrocytes, PBMCs were treated with ALT buffer (140 mM ammonium 

25 chloride, 20 mM Tris, pH 7.2) for 3 min at 57^C, PBMCs were washed twice with RPMI 
containing 20 mM Hepes (200g for 5 min). 

Generation of dendritic cells from peripheral blood monocytes. 

Monocytes were isolated from PBMCs by positive sorting using anti-CD 14 magnetic 
beads (Miltenyi Biotech, Auburn, CA) according to the manufacturer's protocol. 
30 Monocytes were cultured in RPMI supplemented with 1% non-essential amino adds 
(Gibco, BRL, RockviUe, MD), 50 ng/ml recombinant human granulocyte macrophage- 
colony stimulating factor (GM-CSF; S.A. 1.1x10^ U/mg) (Leucomax; Novartis, Basel 
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Switzerland) and 3 ng/ml recombinant human IL-4 (S.A. 2.9x10^ U/pig) (R8dD Systems, 
MinneapoKs, MN). Monocytes were seeded at 0.3 x lOVml in 6- well plates (Costar) for 5 
days to obtain immature dendritic cells. 

The quality of monocyte-derived immature dendritic cells was routinely monitored 
5 by flow-cytometric analysis and assessed to be appropriate when they displayed the 
following phenotype: CDla (hi^), CD3 (neg.), CD14 (low), CD19 (neg.), CD56 (neg.), 
CD80 (low), CD83 (neg.), CD86 (low) and HIA-DR (high). In contrast, mature dendritic 
cells (c£ below) display the following phenotype: CDla (low), CD80 (high), CD83 (high), 
CD86 (high) and HLA-DR (high). Monoclonal antibodies against CDla, CD3, CD14, 
- 10 CD19, CD56, CD80, CD83, CD86 as well as the respective isotype controls were purchased 
from Pharmingen (San Diego, CA). 

Exposure of dendritic cells to serum or synovial fluid 

Serum and synovial fluid were irradiated for 30 min with ^^^Cs (70 TBq). To feed 
dendritic cells with serum- or synovia-derived antigen, 6 x 10^ inmiature dendritic cells 
15 were pulsed with either 1 ml serum or 0.6 ixA synovial fluid. At the same time maturation of 
dendritic cells was induced by adding 10 ng/ml recombinant human tumor necrosis factor 
alpha (TNFc« S.A. 1.1x10^ U/^g). As a control, 6 x 10^ immature dendritic cells were 
incubated with TNFa alone. 

After 24 hrs in culture, mature dendritic cells were harvested by centrifugation at 
20 300g for 10 min. Cells were washed with PBS and transferred to an eppendorf tube. After 
centrifugation at 400g for 3 min, the supernatant was completely removed and the cells 
were frozen at -70®C. 

Generation of anti-HLA class n beads 

The anti-HLA-DR monoclonal antibody (mAb) L243 (ATCC, Manassas, VA) was 
25 produced by culturing the respective mouse hybridoma cell line. mAb L243 was purified 
using ProteinA sepharose (Pharmacia, Uppsala, Sweden) and immobilized to CNBr- 
activated sepharose beads (Pharmacia) at a final concentration of 2.5 mg/ml, according to 
the manu&cturer's protocol. L243 beads were stored in PBS containiag 0.1% Zwittergent 
3-12 (Calbiochem, La Jolla, CA). 
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Nano-scale purification of HLA-DR-peptide complexes 

Pellets of firozen dendritic cells were resuspended in 10-fold volume of ice cold lysis 
buffer (1% Triton-X-100, 20 mM Tris, pH 7,8, 5 mM Mga2, containing protease 
inhibitors chymostatin, pepstatin, PMSF and leupeptin (Roche, Mannheim, Germany)) 
and lysed in a horizontal shaker at 1000 rpm, 4°C for 1 h. The cell lysate was cleared from 
cell debris and nuclei by centrifiigation at lOOOOg, 4*^0 for 10 min. The lysate was co- 
incubated with L243 beads (5-10 \d L243 beads per 100 nl cell lysate) in a horizontal shaker 
at 1000 rpm, 4°C for 2 hrs. Inmiunoprecipitated HLA-DR-peptide complexes bound to 
L243 beads were sedimented by centrifiigation at lOOOg, 4'*C for 1 min and washed four 
times with 500 |il 0.1% Zwittergent 3-12 (Calbiochem) in PBS. 

The efficacy of depletion of HLA-DR-peptide complexes was monitored by analyzing 
the respective cell lysates before and after immunoprecipitation and aliquots of the beads 
by western blotting using the anti-HLA-DRa-specific mAb IBS (Adams, T.E. et al.. 
Immunology 50 (1983) 613-624). 

Elution of HLA-DR-associated peptides 

HLA-DR-peptide complexes boimd to L243 beads were resuspended in 100 |j1 H2O 
(HPLC-grade; Merck, Darmstadt, Germany), transferred to an ultrafiltration tube, 
Ultrafree MC, 30 kD cut-off (Millipore, Bedford, MA) and washed 10 times with 100 |il 
H2O (HPLC-grade) by centrifiigation for 1-2 min at lOOOOg at RT. For duting the bound 
peptides, 60 |il 0.1% trifluoracetic add (Fluka, Buchs, Switzerland) in H2O (HPLC-grade) 
was added and incubation was performed for 30 min at 37'*C. Fluted peptides were 
collected in a new eppendorf tube by centrigugation of the Ultrafree unit at lOOOOg for 3 
min at RT and inmiediately lyophilized in a Speed-Vac™ vacuum centrifuge. 

Fractionation by two-dimensional nanoflow LC 

To perform high-throughput sequencing of complex peptide nndxtures, the MudPIT 
(multidimensional protein identification technology) was used (Washburn, M.P. et al., Nat 
Biotechnol 19 (2001), 242-247) which is based on Uquid chromatographic fractionation 
followed by mass spectrometric sequence determination. 

To this end, lyophilized peptides eluted from HLA molecules were resuspended in a 
buffer containing 5% (v/v) acetonitrUe (ACN), 0.5% (v/v) acetic acid, 0.012% (v/v) 
heptafluoro butyric acid (HFBA) and 1% (v/v) formic acid. The peptide mixture was 
fractionated on a fiised-silica microcapOlary column (100 |im i.d. x 375 (im) generated by a 
Model P-2000 laser puller (Sutter Instrument Co., Novato, CA). The microcolumn was 
packed with 3 fmi / CIS reversed-phase material (C18-ACE 3 \lm [ProntoSIL 120-3-C18 
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ACE-EPS, Leonberg, Germany]) followed by 3 cm of 5 \xm cation exchange material 
(Partisphere SCX; Whatman, Clifton, USA). 

A fially automated 8-step gradient separation on a LC Packings UltiMate HPLC (LC 
Packings, San Francisco, USA) was carried out, using the following buffers: 5% ACN / 

5 0.012% HFBA / 0.5% acetic acid (buffer A), 80% ACN / 0.012% HFBA / 0.5% acetic acid 
(buffer B), 250 mM ammonium acetate / 5% ACN / 0.012% HFBA / 0.5% acetic acid 
(buffer C), and 1.5 M anmionium acetate / 5% ACN / 0.012% HFBA / 0.5% acetic add 
(buffer D). The first 116 min step consisted of a 75 min gradient firom 0 to 40% buffer B 
followed by a 10 min gradient from 40 to 80% buffer B, a 6 min hold at 80% buffer B and a 

10 10 min equilibration step with 100% buffer A. The next 5 steps (146 min each) were 
characterized by the following profile: 5 min 100% buffer A, 5 min gradient from 0 to x% 
buffer C, 5 min 100% buffer A, 30 min gradient from 0 to 10% buffer B, 55 min gradient 
from 10 to 35% buffer B, 20 min gradient from 35 to 50% buffer B, 10 min gradient from 
50 to 80% buffer B, a 6 min hold at 80% buffer B, and a 10 min equilibration step with 

15 100% buffer A. The buffer C percentages (x) in steps 2-6 were as follows: 20, 40, 60, 80, and 
90%. The 30 min gradient from 0 to 10% buffer B, which is the first linear elution step 
from the reversed-phase material, was needed in order to sufficientiy separate peptide 
elution from the elution of a major contaminant (m/z=945) which otherwise would have 
led to the loss of the more hydrophilic peptide peaks. Step 7 consisted of the following 

20 profile: 5 min 100% buffer A, 20 min 100% buffer C, 5 min gradient from 0 to 10% buffer 
B, 35 min gradient from 10 to 35% buffer B, 50 min gradient from 35 to 50% buffer B, 
10 min gradient from 50 to 80% buffer B, a 5 min hold at 80% buffer B and a 10 min 
equilibration step with 100% buffer A. Step 8 was identical to step 7 with the exception of 
using buffer D instead of buffer C. 

25 Ion trap MS/MS mass spectrometry 

The HPLC column was directiy coupled to a Finnigan LCQ Deca XP Plus ion trap 
mass spectrometer (Thermo Finnigan, San Jose, USA) equipped witii a nano-LC 
dectrospray ionization source. Mass spectrometry in the MS/MS mode was performed 
according to the manufacturer's protocol. Peptides were identified by the SEQUEST 
30 algorithm (U.S. patents 6,017,693 and 5,538,897). 
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MALDI-TOF mass spectrometry 

Peptides spotted onto an AnchorChip plate were co-cristallized with matrix 
(5mg/ml; a-c7ano-4-hydrox5r-dnnamic acid (Merck, Darmstadt, Germany), 50% 
acetonitrile, 0.1% trifluoroacetic add). For qualitative analysis of the whole peptide 
repertoire, samples were analyzed on an Ultraflex™ MALDI-TOF mass spectrometer 
(Bruker, Bremen, Germany), according to the manufecturer's protocol. 

Sequence identification by SEQXJEST and differential dataset analysis. 

MS/MS fragmentation data were analyzed with the software SEQUEST (Thermo 
Finnigan, San Jose, USA). From an in-house protein database, which was created based on 
the pubhc databases Swiss-Prot and TrEMBL, SEQUEST extracted for each spectrum aU 
peptide sequences that corresponded to the molecular mass of the parent ion and measured 
the degree of similarity between the experimental spectrum and the theoretical, in silico 
generated, spectrum. Only the top-scoring candidate sequence was listed. 

The peptide sequences derived from the SEQUEST analysis and their accompanying 
information on mass accuracy, scoring parameters and peptide origin were stored in an 
appropriately designed relational database and further processed. Certain constraints were 
enforced in order to guarantee the stor^e of only significant sequences with satisfying 
SEQUEST scores. The two most important constraints were: (i) keep only those sequences 
that have a cross correlation coefficient (CO) hi^er than a certain value and (ii) from the 
remaining sequences keep those ones which have a predefined delta cross correlation 
coefficient (ACC). For both criteria the minimmn chosen values are based on empirical 
knowledge of interpreting SEQUEST resxilts. 

A dataset was defined as the sum of data from a particular set of spectra. The design 
of database and software allowed queries on a single dataset as well as comparisons of 
multiple datasets. Such a database and software design enables comparative sample 
analysis, which is not provided by SEQUEST. For instance, possible queries on a single 
dataset could provide information on the score distribution among the stored spectra, on 
the existence of further sequence length variants or common subsequences, or on the 
protein origin of peptide sequences. Since the occurrence of truncation variants of the same 
epitope is a general characteristic of class II MHC-boimd peptides, the existence of length 
variants in a dataset provides additional strong evidence for the presence of an epitope in a 
set of spectra. 

The most important feature in the analysis of multiple datasets is the possibiKty to 
extract a common subset of sequences that satisfies a given criterion. Such a criterion could 
be based on sequence similarity, e.g., within all sequences of a collection of datasets, those 
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sequences were selected that had at least one subsequence in common with any other 
sequence. Such comparisons across different datasets constitute the dififerential approach 
(RA samples versus control samples) and thereby optimize the search for candidate RA 
marker peptides. 

The pairwise similarity scores between sequences were calculated by a software 
routine, which is an implementation of a standard string-comparison algorithm. 
Subsequently, these scores were used to group closely-related sequences (sequences sharing 
a common subsequence) in well-separated clusters by an additionally developed software 
routine, which is based on a weU-established algorithm (hierarchical clustering, UPGMA). 

The generated clusters (e.g. of peptide truncation variants) were then used, to identify 
closely-related sequences across different datasets. 

Overall, the data evaluation software provided the ability to perform swiftly and 
reprodudbly the following: 

-Select from the sequence output generated by SEQUEST those sequences that satisfy 
reliable empirical criteria. 

- Store the data in a database appropriately designed for the discovery process at 

hand. 

-Extract information about the sequence content of each stored dataset. This 
information is valuable in assessing the importance of individual sequences Mdthin the 
given dataset and, consequently, across multiple datasets. 

-Provide, by virtue of the multiple dataset comparisons, a tool that realizes the 
differential approadi, namdy the study of the actual sequence content of one sample versus 
other(s). 



Purification of HLA-DR molecules 

HLA-DR molecules were purified from 10^° EBV-transformed B cell lines or T2- 
transfectants by affinity chromatography, using anti-DR monoclonal antibody L243, as 
previously described (Kropshofer H. et al., PNAS 92 (1995) 8313-8317). 



In vitro peptide binding assay 
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HA(307-319), PKYVKQNTLKLAT, is an immiinodoininant epitope from influenza 
virus hemagglutinin that binds well to HLA-DR4 molecxdes and was used as a reporter 
peptide in an in vitro peptide binding assay (Rothbard, J.B. et al.. Cell (1988) 52:515-523). 

Purified detergent-solubilized HIA-DR4 molecules (200 nM) were co-incubated 
with biotinylated HA(307-319) peptide (200 nM) and graded amounts of competitor 
peptide (100 nM - 10 //M) for 24 hrs at ^T^C in binding bufifer (50 nM sodium phosphate, 
50 mM sodium citrate, pH 4.8, 0.1% Zwittergent 3-12) in a total volume of 50 //L The 
competitor peptides were derived from the candidate RA antigens identified in this study 
and purchased as synthetic peptides from Medprobe (Lund, Sweden). 

3 X 10 /;1 were then diluted 10-fold in PBS containing 0.05% Tween-20 and 1% BSA 
and incubated for 2 hrs in a microtiterplate (Nalge Nunc), which has been coated over 
night with anti-DR monoclonal antibody L243. Afterwards the samples were developed by 
incubation with 0.1 //g/ml EU-labeled streptavidin (Wallay Oy, Turku, Finland) for 45 min 
according to the manufacturer's protocol. After intensive washing with 0.05% Tween-20 in 
PBS, Europium fluorescence was measured with a time-resolved fluorometer (VICTOR 
1420, Wallac/Perkin Ehner Life Sciences) to quantify the binding of biotinylated HA(307- 
319) peptide to HLA-DR4 molecules (Amdt, S.O. et al., EMBO J. 19 (2000) 1241-1251). 
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Example 1 

In this example, the technique mentioned in Figure 1 was used to identify novel 
HLA-DR-assodated peptide markers derived from serum and synovial flmd of patients 
with non-erosive RA. 

5 6x 10^ immature dendritic cells were pulsed with either 1 nal senmi (5 samples) or 

0.6 ml synovial fluid (2 samples) of patients with non-erosive RA and cultured for 24 hrs in 
the presence of 10 ng/ml TNFa. As a control, 6 x 10^ dendritic cells were cultured in the 
presence of TNFa (10 ng/ml) without adding serum but 1 ml of PBS. In an additional 
experiment 6 x 10^ dendritic cells were pulsed with 1 ml serum from 2 healthy test persons 

10 and cultured for 24 hrs in the presence of TNFa (10 ng/ml). 

Dendritic cells were lysed in detergent TX-lOO and HLA-DR molecules were isolated 
using mAb L243. HLA-DR-associated peptides were eluted with 0.1% TFA and analyzed by 
high-throughput 2D-LC-MS/MS technology. Peptide identification was achieved by using 
the SEQUEST algorithm. The peptide sequences derived from the SEQUEST analysis and 
15 accompanying information on mass accuracy, scoring parameters and peptide origin were 
stored in a database and further processed. 

The peptide sequences identified from unpulsed DCs (control 1) and from DCs 
pulsed with the serum of healthy test persons (control 2) were compared with the peptide 
sequences identified from DCs pulsed with the serum of non-erosive RA patients. Among 
20 the RA-specific sequences, only those peptides were selected for fiirther evaluation that re- 
occurred in at least three of five non-erosive RA samples. 

In each serum sample roughly 600±150 individual peptide sequences (cross 
correlation coefficient CC > 3.0 and ACC > 0.15) were identified. In the synovia samples 
the number of individual peptide sequences was slightly smaller (400±30). Approximately 

25 80-85% of the peptides found in RA samples were also identified in control samples, 
underlining the high reproducibility of the analysis. In the majority of cases, several length 
variants of the same epitope could be identified which is a typical characteristic of class II 
MHC-bound antigens and supports the validity of the results (Jones, E.Y., Curr Opin 
Immunol 9 (1997) 75-79). Further confidence in the quality of the data reHes on the fact 

30 that several of the identified peptides or proteins have already been described in the context 
of MHC class II molecules: epitopes derived from ubiquitous proteins like Hsp70, enolase, 
annexin II, cathepsin C or collagen II, as weU as from MHC molecules (HLA-A, -B, -C, -E, 
-G, and P2-microglobulin) and CLIP (Chicz, R.M. et al., J Exp Med 178 (1993) 27-47; 
SinigagUa, F. & Hanuner, J., Curr Opin Immunol 6 (1994) 52-56; Amold-Schild, D. et al., J 



wo 2005/014622 



41 



PCT/EP2004/008609 



Immunol 162 (1999) 3757-3760; Vogt, A.B. & Kropshofer, H., Trends Biochem Sd 4 
(1999) 150-154) were frequently detected 

RA-spedfic peptide sequences were further validated with regard to binding to the 
RA susceptibflity allele Diy5J'^401 by using the TEPITOPE software (Hammer, J. et al., 
Adv Inraiunol 66 (1997) 67-100). This software provides means for the qualitative and 
quantitative prediction of T cell epitopes. 

The output of the study consists of an epitope that occurred, apart from one 
exception, only in non-erosive RA samples (Table 1). 

Interferon-gamma-indudble lysosomal thiol reductase 

A very interesting epitope which was identified in 3 out of 7 non-erosive RA samples 
from serum and synovia is derived from the interferon-ganmia-indudble lysosomal thiol 
reductase (GILT): the 16-mer GILT (192-207) with the amino add sequence of SEQ ID 
NO: 3 (Table 1). Further length variants in three other samples support the relevance of 
this epitope (Table 1): the 14-mer GILT (192-205; SEQ ID NO: 1) and the 17-mer GILT 
(192-208; SEQ ID NO: 2). 

As judged from the shortest length variant, GILT (192-205), the epitope contains a 
suitable binding motif, with regard to binding to the RA susceptibiHty allele DRBl'^0401: 
196M serves as a PI anchor, 199M as a P4 anchor and 201 A as a P6 anchor. According to 
TEPITOPE scoring, the epitope has a binding score (threshold value) of 1% which is 
similar to the binding score of an epitope from influenza haemagglutinin (307-319) that 
was shown to be a strong DRB1*0401 binder (Table 1) (Rothbard, J.B. et al., Cell 52 (1988) 
515-523). 

GILT is constitutively expressed in antigen-presenting cells, such as dendritic cells, 
macrophages and B cells, and facilitates unfolding of endocytosed antigens in MHC class 
Il-contarning compartments (MIIC) by enzymatically reducing disulfide bonds (Phan, 
U.T. et al., J Biol Chem 275 (2000) 25907-25914). Direct binding of GILT to HLA-DR 
molecules has been reported for B cells (Arunachalam, B. et al., J Immunol 160 (1998) 
5797-5806). A rather long second epitope of GILT was found to bind to HLA-DR3 
molecules: the 22-mer GILT (38-59) having the amino acid sequence 
SPLQALDFFGNGPPVNYKTGNL (Chicz, R.M. et al., J Exp Med 178 (1993) 27-47). 

In addition to GILT (192-207), another epitope of the same protein was identified in 
several RA samples, but also in control samples: GELT (210-227) with the amino acid 
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sequence QPPHEl'VPWVTVNGKPLE. This epitope was accompanied by 3 other length 
variants: the 16-mer GILT (210-225), the 17-mer GILT (210-226) and the 19-mer GILT 
(210-228). 

As indicated by its name, GILT e3q)ression can be induced by the pro-inflammatory 
cytokine interferon gamma (IFN-y) in various types of cells, including macrophages, 
endothelial cells and fibroblasts (Luster, A.D. et al., J Biol Chem 263 (1988) 12036-12043). 
As IFN-Y is known to be present in inflamed joints of RA patients, GILT could become 
over-expressed in synovia and serum and, hence, could be taken up by DCs as an 
exogenous antigen. GILT (192-207) maybe derived fi:om exogenous GILT. The other GILT 
epitope, which is also present in the control samples, may be derived from endogenous 
GILT, expressed by DCs. Alternatively, both GILT (192-207) and GILT (210-227) may be 
derived from endogenous GILT, in case that GILT processing and GILT-derived epitope 
presentation by DCs were critically altered upon contact with RA-associated material. 

The identified epitope of Interferon-gamma-inducible lysosomal thiol reductase 
GILT (192-205) which has been described already in detail, was fiirther analysed in an 
in vitro binding assay using synthetic GILT (192-205) peptide and purified HLA-DR4 
molecules (Fig. 3): In agreement with TEPITOPE scoring, the peptide was shown to bind to 
HLA-DR4 with high afSnity, comparable to the viral HA( 307-319) peptide. 



Integrin beta-2 

The extended analysis revealed the presence of an epitope which was identified in two 
out of seven non-erosive RA samples from serum and synovia and which is derived from 
the beta subunit of Integrin (ITB2): the 17-mer ITB2 (315-331; SEQ. ID NO: 58) with the 
amino acid sequence NIQPIFAVTSRMVKTYE (Table 1). One length variant was found: 
the 19-mer ITB2 (313-331; SEQ. ID NO: 59) supporting the validity of the identified 
epitope (Table 1). 

The peptide sequence contains a moderate HLA-DRB1'*^0401 binding motif with 3161 
and 3191 serving as PI and P4 anchor, respectively (TEPITOPE binding score: 2%). 

Integrins are a family of cell sur£ice receptors that play important roles in 
embryogenesis, wound healing, immune responses, and cell adhesion. ITB2 is a 
heterodimeric receptor for intercellular and vascular adhesion molecules (ICAMs and 
VCAMs) and exclusively expressed on leukocytes. ICAMs and VCAMs are members of the 
immunoglobulin superfamily that has a central function in humoral and cell-mediated 
immune responses. Many cytokines, such as interferon- Y> IL-1 and TNFa, which are 
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upregiilated in inflammatory diseases like RA, induce ejqpression of ICAMs on the surface 
of endothelial cells. It was shown that the expression of ICAM-1 and VCAM-1 is higher in 
synovial tissue of RA patients as compared to osteoarthritis patients (Furuzawa-CarbaUeda, 
J. et al., Scand J Immunol 50 (1999) 215-222). 

Binding of ITB2 and other integrins to ICAMs allows leukocytes to infiltrate into 
their target tissues, e.g. the sites of inflammation. In order to function in a non-adherent or 
circulating mode, leukocytes constitutivdy express ITB2 and other integrins with low 
Ugand-binding capacity. The ITB2/ICAM-1 interaction has become an important strategic 
target to approach inflammation, autoimmune diseases and cancer (Yusuf-Makagiansar, 
H. et al., Med Res Rev 22 (2002) 146-167), hence these findings strongly support the 
validity of the identified 1X62 epitope as a candidate marker for RA. 



Phosphatidylinositol-4,5-bisphosphate 3-kinase 

Another epitope with a moderate HLA-DRBl'^0401 binding motif was identified in 
two non-erosive senmi samples: the 17-mer PI3K (792-808; SEQ. ID NO: 60) with the 
amino acid sequence NKVFGEDSVGVIFKNGD derived from Phosphatidylinositol 3- 
kinase (PI3K ) (Table 1). In this peptide 794V could serve as a hydrophobic PI anchor, 
797E as a negatively charged P4 anchor, and 799S as a typical DR4-P6 anchor (TEPITOPE 
binding score: 3%). 

PI3K is ubiquitously expressed in many cell types and phosphorjdates Upids, 
predominantiy phosphatidylinositol-4,5-bisphosphate. The protein has emerged as a key 
signal transducer for survival factor receptors, including growth factors, cytokines, and 
integrins (reviewed by Toker, A. & Cantiey, L., Nature 387 (1997) 673-676). In addition, 
PI3K plays an important role in the signaling pathway of Toll-like receptors (TLRs) that 
recognize a variety of microbial products, collectively termed pathogen-associated 
molecular patterns (PAMPs) (Fukao, T. 8c Koyasu, S., Trends Inmiunol 24 (2003) 358- 
363). Stimulation through TLRs by PAMPs, such as hpopolysaccharide (LPS) (endotoxin) 
triggers production of various cytokines, including IL-12, which is a key cytokine in TLR- 
mediated Thl responses. It was shown that PI3K is an endogenous suppressor of TLR- 
mediated IL-12 production and limits excessive Thl polarization. Thus PI3K was suggested 
to be a negative regulator of innate immune responses in order to prevent prolonged 
activation of innate immunity harmful to the host. The knowledge that autoimmune 
responses in RA rdy on Thl cytokines, ultimately links PI3K witii RA. Although no specific 
pathogen has been consistentiy linked to the development of RA, it is believed that RA 
develops in two stages in which an initial response induced by foreign antigens. 
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subsequently develops into a self-sustaining autoimmune response (Kliiunan, D., Arthritis 
Rheum 48 (2003) 590-593). 



Urokinase-type plasminogen activator 

Another epitope which was only found in non-erosive RA samples was derived from 
a urokinase-type plasminogen activator (uPA): the 16-mer uPA (328-343; SEQ. ID NO: 61) 
with the amino acid sequence YPEQLKMTVVKLISHR (Table 1). With regard to HLA- 
DRB1'*^0401 binding, the sequence reveals a moderate binding motifc 332L, 335T and 337V 
are putative PI, P4 and P6 anchors, respectively (TEPITOPE binding score: 2%). 

Plasminogen activators (PAs) are highly specific serine proteases generating plasmin 
fi:om plasminogen. Two types of PAs, a urokinase-type (uPA) and a tissue-type (tPA) have 
been identified in mammals. The activities of PAs are controlled by natural inhibitors 
(PAIs). Plasmin is involved in inflammatory reactions by inducing cytokines such as TGpp, 
and in cartilage and fibrin degradation. Evidences for ongoing coagulation within the 
rhexraiatoid joint together with an enhancement of uPA synthesis and activation in 
arthritic joints have led to the hypothesis that the PA/plasmin system is associated with the 
clinical severity of arthritis (reviewed by Busso, N. & Hamilton, J.A., Arthritis Rheum 46 
(2002) 2268-2279). Based on in vitro studies, there are several cell types present in the 
inflamed joints of RA patients, in particular monocytes and synovial fibroblasts which can 
e3q>ress PAs and PAIs and which therefore could contribute to the in vivo levels. The 
activity of PAs (i.e. uPA) is stimulated by cytokines, such as IL-1 and TNFa, which are 
known to be highly upregulated in serum as well as in synovial fluid of RA patients. These 
lines of evidence render uPA (328-343) a putative peptide marker for RA 



Immunglobulin heavy chain V-III region (Vh26) 

One of the identified epitopes that display a very strong binding motif with regard to 
HLA'DRB1*0401 binding, is derived fi:om the Vh26 gene segment of the immunglobulin 
heavy chain: the 16-mer Vh26 (95-110; SEQ. ID NO: 62) with the amino acid sequence 
KNTLYLQMNSLRAEDT (Table 1). The high TEPTTOPE binding score (1%) is 
substantiated by 99Y fionctioning as a very potent PI anchor, 102M as a moderate P4 
anchor and 104S ais a typical HLA-DR4 P6 anchor. 

Immunglobulins (Ig) are responsible for antigen binding and stimulate further 
immune reactions, e.g. by binding to isotope-spedfic Fc receptors- Interestingly, the 
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human Vh26 gene segment appears to encode a high-avidity synovial rheumatoid factor 
and thus seems to have an impact on RA progression (Wong, A. et al.. Autoimmunity 20 
(1995) 191-199). The fact that the identified epitope Vh26 (95-110) was foimd in the serum 
of two seropositive RA patients (Table 1) correlates with the above observation. 



DJ-1 protein 

In two non-erosive RA samples our continued studies revealed the presence of an 
epitope which is derived from a protein termed DJ-1: the 16-mer DJ-1 (135-150; SEQ. ID 
NO: 63) v^th the amino acid sequence NGGHYTYSENRVEKDG (Table 1). According to 
TEPITOPE scoring die peptide contains a rather weak HLA-DRBl *0401 binding motif with 
139Y serving as a PI anchor, 142S as a P4 anchor and 144N as a possible P6 anchor 
(binding score: 8%). 

DJ-1 belongs to the ThiJ/PfpI protein family whose members are evolutionary 
distributed from Archaea to Eukarya. ThiJ/PfpI proteins share a conserved ThiJ domain 
that is structurally related to the type I glutamine amidotransferase domain (Lee, SJ. et al., 
J Biol Chem 25 (2003) Epub ahead of print). 

DJ-1, which is preferentially expressed in testis and moderately in other tissues, was 
first identified as a novel candidate of the oncogene product that transformed mouse 
NIH3T3 cells in cooperation with ros. In the mean time additional physiological roles of 
DJ-1 were revealed including a strong correlation to Parkinson disease and sperm 
fertilization. DJ-1 seems to have multiple frmctions — here we provide the first indication 
for an association of DJ-1 with RA. 



Example 2 

In this example, the same technology was used that has been described in detail in 
example 1. Serum (6 samples) and synovial fluid (2 samples) of patients with diagnosed 
erosive RA were utilized in this case to identify candidate markers specific for erosive RA. 

The peptide sequences found in the erosive RA samples were compared with the 
sequences identified in unpulsed DCs (control 1) and in DCs pulsed with the serum of 
healthy test persons (control 2). Among the RA-spedfic sequences, only those peptides 
were selected for fiirther evaluation that re-occurred in at least three of six erosive RA 
samples. 



wo 2005/014622 PCT/EP2004/008609 

46 

In this study one epitope was discovered which occurred, apart from one exception, 
only in erosive RA samples. 



Apolipoprotein B-lOO 

The epitope which was mainly found in erosive RA sera (4 out of 8 erosive RA 
samples) is derived from apolipoprotein B-lOO: the 16-mer ApoB (2877-2892) with the 
amino acid sequence of SEQ ID NO: 4 (Table 2). In addition a length variant of the same 
epitope was identified (Table 2): the I7-mer ApoB (2877-2893; SEQ ID NO: 5). The 
following binding motif can be predicted: 2881L as a PI anchor, 2884D as a P4 

anchor and 2886N as a F6 anchor (binding score 3%). 

In an earlier study on EBV-B cells, the epitope ApoB (2885-2900), which partly 
overlaps with the epitope described here, has been found in the context of HLA-DR4 
(Chicz, R.M- et al., J Exp Med 178 (1993) 27-47). 

ApoUpoprotein B-lOO is a constituent of very low-density Upoproteins (VLDL) and 
low-density Upoproteins (LDL) and frmctions as a recognition signal for the cellular 
binding and internalization of LDL particles by the ApoB/E receptor (Yang, C.Y. et al.. 
Nature 323 (1986) 738-742). Interestingly, an increased ratio of LDL cholesterol to HDL 
cholesterol was observed among newly diagnosed RA patients (Park, Y.B. et al., J 
Rheumatol 26 (1999) 1701-1704). The adverse lipid profile in active RA could be improved 
by treating RA patients with DMARDs without the use of lipid-lowering agents (Park, Y.B. 
et al., Am J Med 113 (2002) 188-193). Since an increased cardiovascular mortality among 
patients with chronic inflammatory diseases, such as RA, is well documented (Symmons, 
D.P, et al., J Rheumatol 25 (1998) 1072-1077) it was suggested that local inflammation in 
RA leads to altered blood lipid levels, thereby increasing the risk of atherosclerosis. The 
question whether components of the lipoprotein metabolism are causal for pathogenesis or 
merely affected by ongoing immune reactions during RA development cannot be answered 
yet. However, the observation of adverse lipid profiles in RA patients supports the validity 
of the presented ApoB epitope as a serum-derived RA candidate marker. 

The length variant ApoB (2877-2892), but not ApoB (2877-2893), has been identified 
in samples of two healthy controls (Table 2). Since Apolipoprotein B constitutes 1% of all 
plasma proteins, the presence of ApoB epitopes in healthy control samples is not 
surprising. The results suggest that only the length variant ApoB (2877-2893; SEQ ID NO; 
5) is specific for erosive RA. 

268 proteasome non- ATPase regulatory subunit 8 
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An epitope which was quite frequently identified in mostly erosive RA samples both 
firom serum and synovia, is derived from the regulatory subunit 8 of the 26S proteasome 
(PSMD8): the 15-mer PSMD8 (218-232; SEQ. ID NO: 64) with the amino acid sequence 
GPNNYYSFASQQQKP (Table 2). Three additional length variants were identified: the 16- 
mer PSMD8 (218-233; SEQ. ID NO: 65), the 17-mer PSMD8 (218-234; SEQ. ID NO: 66) 
and the 18-mer PSMD8 (218-234; SEQ. ID NO: 67) (Table 2). The presence of length 
variants supports the validity of the identified epitope as a class II MHC-derived antigenic 
peptide. The peptide displays a moderate binding motif with respect to DRBl "^0401 binding 
(TEPITOPE binding score: 3%). Binding to HLA-DR4 could be confirmed in an in vitro 
binding assay using synthetic PSMD8 (218-233) peptide and purified HLA-DR4 molecules 
(Fig. 3): According to its IC50 value against the reporter peptide HA(308-319), PSMD8 
(218-233; SEQ. ID NO: 65) binds to HLA-DR4 with a moderate afBnity, confirming the 
TEPITOPE prediction. The 15-mer PSMD8 (218-232; SEQ. ID NO: 64) was identified once 
in an impulsed control sample. 

The proteasome accouts for about 1% of the cytoplasmic protein pool and is 
involved in ATP-dependent degradation of ubiquitinated proteins. Moreover, the 
proteasome is responsible for the processing of several transcription factors (i.e. nuclear 
fector-KB), for cell cyde control and the generation of MHC class I restricted antigens. The 
regulatory subunits of the proteasome are important for the selectivity of protein 
degradation. The non-ATPase regulatory subunit 8, in particular, is known to be necessary 
for the activation of the cell division control protein 28 (Cdc28), an essential cell cycle 
regulator in yeast. 

Importantly, strongly increased levels of circulating proteasomes (cProteasomes) 
were detected in serum samples obtained from patients with different systemic 
autoimmime diseases, including RA (Egerer, K. et al., J Reiunatol 29 (2002) 2045-2052). 
There appears to be a close correlation between the levels of cProteasome and disease 
activity and concentrations of C-reactive protein in patients with severe RA. cProteasome is 
discussed to trigger subsequent immune responses, which would indicate an antigen- 
driven mechanism. The concentration of released Proteasome antigen seems to reflect the 
magnitude of cellular damage in autoimmune diseases. From their findings Egerer and 
coworkers concluded that cProteasomes covdd represent a novel marker for disease severity 
in autoimmune processes. Since the epitope PSMD8 (218-232; SEQ. ID NO: 64) was 
mainly identified in erosive RA samples our analysis supports this conclusion. 



Interleukin-1 receptor 
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Another candidate marker for erosive RA is the Interleukin-1 receptor (IL-IR) which 
was identified in two erosive serum samples via its peptide IL-IR (79-94; SEQ. ID NO: 68) 
with the amino acid sequence EKLWFVPAKVEDSGHY (Table 2). IL-IR (79-94; SEQ. ID 
NO: 68), with 83F as a PI anchor, 86A as a P4 anchor and 88V as a P6 anchor, contains a 
strong binding motif with regard to binding to the RA susceptibility allele DRBl'^0401 
(TEPrrOPE binding score: 1%). In an in vitro binding assay the synthetic peptide was 
shown to bind to HLA-DR4 molecules with high affinity (Fig. 3), similar to HA(309-319) 
peptide. 

Interleukin-1 (IL-1) is a proinflammatory cytokine and implicated in a variety of 
infectious immime responses as well as in RA and other inflammatory diseases (DinareUo, 
C, Blood 87 (1996) 2095-2147). It binds to its respective receptor which functions as a 
signal transducer to trigger ceU proliferation or to stimulate protein synthesis. Increased IL- 
1 production has been observed in RA patients and plays a pivotal role in its clinical 
manifestations (Dayer, J.M„ Rheumatology 42 (2003) ii3-'iil9). IL-1 is viewed as a key 
mediator in RA through activation of macrophages and T- and B- lymphocytes. In 
addition, IL-1 contributes to inflammation by inducing the expression of cell-adhesion 
molecules, other cytokines and chemokines. Furthermore IL-1 is also pivotal in the 
destmction of bone and cartilage in RA by stimulating the production of matrix 
metalloproteinases. Thiis, the IL-l/IL-lR complex is a first class target for anti- 
inflammatory therapeutics. The use of a recombinant human IL-1 receptor antagonist (IL- 
IRA) has been approved for the treatment of RA patients. 



Fibromodulin 

Three erosive RA samples gave rise to an epitope derived from the secreted matrix 
protein fibromodulin (FM): the 13-mer FM (178-190; SEQ. ID NO: 70) with flie amino 
acid sequence LRELHLDHNQISR (Table 2). Furthermore, the 14-mer FM (177-190; SEQ. 
ID NO: 69) was identified, which was also foimd once in an unpulsed control experiment. 
The epitope depicts a strong DRBl'^0401 binding motif with 181L serving as a PI anchor, 
184D as a P4 anchor and 186N as a P6 anchor (TEPITOPE binding score: 1%). 

Fibromodulin belongs to a fanuly of small leucine-rich proteoglycans (SLRPs) that 
bind to TGpps and collagens and other extracellular matrix molecules. In vitroy SLRPs were 
shown to regulate colls^en fibrillogenesis, a process essential in devdopment, tissue repair 
and metastasis. To better understand the function of SLRPs in vivoy SLRP-defident mice 
were generated and shown to develop a wide array of diseases (e.g. osteoporosis and 
osteoarthritis), most of them resulting primarily from an abnormal collagen fibrillogenesis 
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(Ameye, L. & Young, M.F., Glycobiology 12 (2002) 107R-116R). Since collagen formation 
and degradation is highly enhanced in the inflamed joints of RA patients, an increased level 
of fibromodulm can be envisaged. In accord with this consideration is the fact that the 
identified FM epitope was primarily foimd in RA samples of synovial origin (3 out of 4 
synovia samples). 



GM-CSF/IL-3/IL-5 receptor 

In 5 out of 8 samples from erosive RA patients an epitope from the p-chain of the 
multiple cytokine receptor CYRB could be identified: the 15-mer CYRB (359-373; SEQ. ID 
NO: 71) with the amino acid sequence ETMKMRYEHIDHTFE and its length variant, the 
17-mer CYRB (359-375; SEQ. ID NO: 72) (Table 2). In an in vitro binding assay the 
synthetic CYRB (359-375) peptide was shown to bind to HLA-DR4 molecules with 
moderate affinity (Fig. 3). This is in agreement with TEPITOPE which predicted a 
moderate peptide binding motif with respect to DRB1*0401 binding (TEPITOPE binding 
score: 3%). The epitope was clearly overrepresented in the RA samples since it was 
identified in the serum of only one healthy test person. 

The GM-CSF/IL-3/IL-5 receptor is a type I membrane protein and differentially 
es^ressed throughout the hematopoietic system (reviewed by Geijsen, N. at al.. Cytokine 
Growth Factor Rev 12 (2001) 19-25), Its ligands, IL-3 and GM-CSF, are secreted by CD4+ 
T cells and important stimuli for the formation of dendritic cells originating firom 
progenitor cells in the bone marrow. The critical role of dendritic cells in the initiation and 
development of an immune response suggests that they play a key role in the development 
of autoimmune inflammatory disorders, such as RA, by transporting autoantigen to the 
draining lymph node where DCs encoimter and prime naive T cells. The activity of GM- 
CSF has been linked to proinflammatory effects in RA supporting the identified receptor 
epitope CYRB (359-373; SEQ. ID NO: 71) as a putative candidate peptide marker for the 
diagnosis of RA. 



Sorting nexin 3 

Another epitope which appears to be indicative for erosive RA is derived firom a 
protein termed sorting nexin 3 (SNX3): the 16-mer SNX3 (142-157; SEQ. ID NO: 73) 
having the amino acid sequence HMFLQDEIIDKSYTPS (Table 2). In this epitope 144F, 
147D and 1491 covdd serve as PI, P4 and P6 anchors, respectively, in the peptide-binding 
groove of the RA susceptibility allele DKBl '*^0401 (TEPITOPE binding score: 2%). 
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Sorting nexins are a diverse group of celliilar trafficking proteins that share a 
common phospholipid-binding motif (reviewed by Worby, CA. & Dixon, J.E., Nat Rev 
Mol Cell Biol 3 (2002) 919-31). The ability of these proteins to bind specific phospholipids, 
as well as their propensity to form protein-protein complexes, points to the involvement of 
these proteins in membrane trafficking and protein sorting. Sorting nexin 3 in particular is 
present in the cytosol and in endosomes and appears to be involved in membrane 
trafficking from early to recycling endosomes. Whether sorting nexin 3 plays a role in RA, 
e.g. by influencing antigen presentation routes, remains unknown at the moment. The 
epitope SNX3 (142-157; SEQ. ID NO: 73) identified in this study suggests a link between 
sorting nexins and autoimmunity. 



Example 3 

All peptide sequences identified in examples 1 and 2 firom non-erosive and erosive 
RA samples were used in this example to search for common markers relevant for both RA 
types. The RA-specific sequences were again compared with peptide sequences of the 
control samples (unpulsed DCs and DCs pulsed with the serum of two healthy test 
persons) and only those peptides were selected for fiirther evaluation that re-occurred in at 
least three of altogether fifteen RA samples (erosive and non-erosive RA). 



Inter-alpha-trypsin inhibitor 

Ten out of eleven serum samples (erosive & non-erosive RA) gave rise to an epitope 
derived from the heavy chain H4 of the iQter-alpha- trypsin inhibitor: ITIH4 (271-287) with 
the amino acid sequence of SEQ ID NO: 8 (Table 3). Apart from this major length variant 
of the rnH4 epitope, six length variants of the same ITIH4 epitope could be identified 
(Table 3): the 19-mer ITIH4 (271-289; SEQ ID NO: 6), the 18-mer ITIH4 (271-288; SEQ 
ID NO: 7), the 16-mer ITIH4 (274-289; SEQ ID NO: 12), the 15-mer ITIH4 (273-287; SEQ 
ID NO: 10), the 15-mer (274-288; SEQ ID NO: 11) and the 14-mer ITIH4 (274-287; SEQ 
ID NO: 9). 

As judged firom the shortest length variant, ITIH4 (274-287), the epitope contains a 
very strong binding motif, with regard to binding to the RA susceptibility allele 
DRBl'^0401: 277F serves as a PI anchor, 280D as a P4 anchor and 2828 as a P6 anchor 
(binding score: 1%). 
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ITIH4 belongs to the Inter-alpha-inhibitor (lod) family which is a group of serum 
protease inhibitors that bind to hyaluronic acid (HA) and appear to be involved in acute- 
phase reactions (Salier, J.P. et aL, Biochemical Journal 315 (1996) 1-9). 

HA is a polysaccharide found in all tissues of the body, in particular, in loose 
connective tissue, e.g. joint fluid (Evered, D. & Whelan, J. eds.. The Biology of Hyaluronan, 
John Wiley & Sons (1989)). HA has an important structural function in cartilage and other 
tissues where it stabilizes the extracellular matrix by forming aggregates with proteoglycans. 
It has also been assigned important biological functions by regulating cellular activities via 
binding to cell surfece proteins, such as CD44 and ICAM-1 (Knudson, C.B. & Knudson, 
W„ FASEB J 7 (1993) 1233-1241; Hafl, C.L. et al., J Cett Biol 126 (1994) 575-588). RA is 
accompanied by a large increase in total HA in the joint fluid as well as in the serum, 
suggesting that circidating HA originates from rheumatoid joints (Engstrom-Laurent, A. et 
al., Scand J Clin Lab Invest 45 (1985) 497-504). 

Complexes of HA and some lal family members were observed in large amoimts in 
the synovial fluid of RA patients (Jessen, T.E. et al.. Biological Chemistry Hoppe-Seyler 375 
(1994) 521-526). The role of the lod-HA complex in inflamtmatory reactions nadght be to 
modify the CD44-HA interaction that mediates leukocyte activation and invasion (Isacke, 
CM. & Yarwood, H., Int J Biochem Cell Biol 34 (2002) 718-721). Additionally, synovial 
fluid of RA patients contains elevated levels of TSG-6, an anti-inflammatory ^ycoprotein 
and a member of the hyaladherin family of HA-binding proteins (Wisniewski, H.G. et al., J 
Immunol 151 (1993) 6593-6601). It has been shown that a complex of TSG-6 with led 
&mily members inhibits the activity of plasmin, a central molecule in the activation of 
inflammation-associated enzymes (Wisniewski, H.G. et al., J Immunol 156 (1996) 1609- 
1615). A regulation of plasmin activity by several acute-phase plasma proteins, namely 
TSG-6 and led famfly members, may prove to be important in RA, given the high contents 
of HA, TSG-6 and led feinuly members in synovial fluid of inflamed joints. 

This evidence, together with the identification of multiple length variants of the same 
epitope and a strong HLA-DR4 binding motif, convincingly support the vaKdity of the 
presented ITIH4 epitope as a sermn-derived RA candidate marker. 



Complement C4 

In eig^t out of eleven RA sera (erosive and non-erosive) tested, another dominant 
epitope was identified which is derived firom complement C4: the 15-mer C4 (1697-1711) 
with the amino acid sequence of SEQ ID NO: 13 (Table 3). Five additional length variants 
of the same epitope could be found (Table 3): the 12-mer C4 (1697-1708; SEQ ID NO: 18), 
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the 13-mer C4 (1698-1710; SEQ ID NO: 17), the 14.mer C4 (1697-1710; SEQ ID NO: 15), 
the 16-mer C4 (1697-1712; SEQ ID NO: 14) and the 18-mer C4 (1697-1714; SEQ ID NO: 
16). Moreover, the presented epitope displays a very strong DRBl'^0401 binding motif: 
1700Y as PI anchor, 1704D as P2 anchor and 1706N as P6 anchor (binding score: 1%). 

C4 which constitutes approximately 0.5% of plasma protein mass plays a critical role 
in the triggering of the central pathway of the complement system. The protein is 
synthesized as a single-chain precursor and, prior to secretion, is enzymatically cleaved to 
form a trimer of non-identical a-, p-, and y-chains. The identified epitope C4 (1697-1711) 
is located at the very C-terminus of the C4 ychain. The C4 a-chain is further 
proteolytically degraded by activated CI to form the C4a anaphylatoxin, which is a 
mediator of local inflammatory processes (Moon, K.E. et al., J Biol Chem 256 (1981) 8685- 
8692). 

In general, the complement cascade is involved in the induction and progression of 
inflammatory reactions and is a major defense system against various pathogenic agents, 
including bacteria, viruses and other antigens (Morgan, B-P., Methods Mol Biol 150 (2000) 
1-13). Inappropriate activation, however, can lead to tissue damage and manifestation of 
disease (Speth, C. et al., Wien Kim Wochenschr 111 (1999) 378-391). 

Activation of the complement system has been repeatedly impUcated in the 
pathogenesis of RA, based on studies showing increased levels of complement metabolites, 
including C4 and C4a, in plasma, synovial fluid and synovial tissue of RA patients 
(Neumann, E. et al.. Arthritis Rheum 46 (2002) 934-945). In addition collagen-induced 
arthritis (CIA) in mice is characterized by the presence of complement activation products 
(Linton, S.M. & Morgan, B.P., Mol Immunol 36 (1999) 905-914). CIA is prevented after 
treatment with anti-C5 monoclonal antibodies (Wang, Y. et al., PNAS 92 (1995) 8955- 
8959) or with soluble CRl, an inhibitor of the complement system, delivered by gene 
therapy (Dreja, H. et al.. Arthritis Rheum 43 (2000) 1698-1709), Activation of complement 
factors in joints is possibly induced by the presence of various inraiune complexes and it 
was hypothesized that stimulation of the innate immune system by infectious agents and 
cytokines may contribute to the initiation of RA (Friese, MA. et al., Oin Exp Immunol 121 
(2000) 406-414). 

Two of the six presented C4 epitopes, the 15- and the 18-mer, were also identified in 
healthy control samples (Table 3) indicating that only some length variants of this C4 
epitope are RA-specific, namely the antigenic peptides of SEQ ID NOs: 14, 15, 17, and 18. 



Complement C3 
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Another epitope that was found in erosive and non-erosive RA samples is derived 
from complement C3 (alpha-chain): the 14-mer C3 (1431-1444) with the amino acid 
sequence of SEQ ID NO: 21 (Table 3). Six additional length variants of the same epitope 
were identified in serum (Table 3): the 13-mer C3 (1431-1443; SEQ ID NO: 23), the 14- 
mer C3 (1429-1442; SEQ ID NO: 74), the 15-mer C3 (1431-1445; SEQ ID NO: 22), the 15- 
mer C3 (1429-1443; SEQ ID NO: 20), the 17-mer C3 (1427-1443; SEQ ID NO:75) and the 
19-mer C3 (1426-1444; SEQ ID NO: 19). As judged from the shortest length variant, C3 
(1431-1443), a DRB1'*^0401 binding motif can be postulated: 1434Y serves as a PI anchor, 
1437D as a P4 anchor and 1439A as a P6 anchor. 

In erovise and non-erosive RA samples a further epitope was found, which is derived 
from complement C3 (beta-chain): the 19 mer C3 (157-175) with the amino acid sequence 
of SEQ. ID NO: 76 (Table 3). One additional length variant of the same epitope was 
identified in serum (Table 3): 20-mer C3 (157-176; SEQ. ID NO: 77) 

Complement C3 which constitutes about 1-2% of plasma protein mass plays a central 
role in the activation of the complement system and belongs to the £unily of the acute- 
phase proteins. Its processing by C3 convertase to C3a anaph^datoxin and C3b is the central 
step in both the classical and alternative complement pathways (Barrington, R. et al., 
Immunol Rev 180 (2001) 5-15). After activation, C3b can bind covalently, via a reactive 
thiolester, to cell sur&ce carbohydrates or inamime aggregates (Isaac, L. & Isenman, D.E., J 
Biol Chem 267 (1992) 10062-10069). The identified epitope C3 (1431-1444) is located at 
the C-terminus of C3b. 

As already discussed in the context of complement epitope C4 (1697-1711), there is 
increasing evidence for an important role of components of the complement cascade in the 
pathophysiology of RA. The result of this study, in which two major epitopes derived from 
complement C3 and C4 were identified in serum of RA patients, underlines the dose link 
between the activated complement system and pathogenesis of RA. This coincidence makes 
a strong argument for the vaUdity of the presented C3 / C4 epitopes as serum-derived 
candidate RA markers. 



SH3 domain-binding ^utamic acid-rich-like protein 3 

Another epitope which was elucidated quite frequently in senmi of RA patients (six 
out of deven erosive and non-erosive RA samples), is derived from the SH3 domain- 
binding glutamic add-rich-like protein 3 (SH3BGRL3): SH3BGRL3 (15-26) with the 
amiao add sequence of SEQ ID NO: 25 (Table 3). Three length variants of the same 
epitope were identified (Table 3): the 14-mer SH3BGRL3 (13-26; SEQ ID NO: 26), the 14- 
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mer SH3BGRL3 (15-28; SEQ ID NO: 27) and the 16-mer SH3BGRL3 (13-28; SEQ ID NO: 
24). The DRBl'^0401 binding motif is: 171 as Pi anchor, 20Q as P4 anchor and 22S as P6 
anchor (binding score 4%). 

SH3BGRL3 is a small 10 kD protein that belongs to the SH3BGR family. The precise 
function of the protein is miknown but a role as a modulator of glutaredoxin biological 
activity is postulated (Mazzocco, M. et al., Biochem Biophys Res Conunim 285 (2001) 540- 
545). So far, SH3BGRL3 has not been described in the context of RA. 

Interestingly, the analysis elucidated a second epitope of the same protein, which was 
highly abundant in all RA and control samples: the 16-mer SH3BGRL3 (29-44) with the 
amino acid sequence DGKRIQYQLVDISQDN. In addition multiple length variants of the 
same epitope were foimd in most samples as well. As judged from the shortest length 
variant, SH3BGRL3 (31-42), the epitope contains almost similar DRBl'^0401 anchor 
residues compared with SH3BGRL3 (15-26): 331 serves as a PI anchor, 36Q as a P4 anchor 
and 38V as a P6 anchor (binding score -2). This similarity is reflected by comparable 
binding scores. 

The presence of this second SH3BGRL3 epitope supports the vaHdity of the 
SH3BGRL3 (15-26) epitope because both peptides are derived from the same protein, 
however, only one of them, epitope SH3BGRL3 (15-26), appears to be generated in a RA- 
specific manner. A similar observation has been described already for GILT in example 1. 

Among the four SH3BGRL3 length variants the longest variant, SH3BGRL3 (13-28), 
was also identified in a healthy control sample (Table 3). However, this particular length 
variant was found only one time, which indicates a significant enrichment of the 
SH3BGRL3 epitope in the context of RA. 



Interleukin-4 (IL-4) induced protein 1 

In all the investigated synovial fluids (erosive & non-erosive RA) and in eight out of 
eleven sera (erosive & non-erosive RA), one highly dominant epitope was identified which 
is derived from the human homolog of the IL-4 induced protein 1 (Figl): Figl (293-309) 
with the amino acid sequence of SEQ ID NO: 28 (Table 3). The vaHdity of the epitope was 
ftirther supported by the presence of additional length variants in several samples (Table 3): 
the 16-mer Figl (293-308; SEQ ID NO: 30) and the 19-mer Figl (293-311; SEQ ID NO: 
29). Moreover, the amiao acid sequence displays a typical DRjBJ*040J binding motif. 299V 
serves as PI anchor, 302E as P4 anchor and 304S as P6 anchor (binding score 1%). 
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Two length variants of the same epitope, Figl (293-308) and Figl (293-309), were 
identified in one unpulsed sample and in one healthy control sample as well (Table 3). 
However, the presence of the Figl epitope in almost all RA samples but not in all of the 
control samples tested strongly indicates an enrichment in Ae context of RA. 

The himian^^^i gene was first identified in IL-4-stimulated B cell cultures (Chu, C.C. 
& Paul, W.E., PNAS 94 (1997) 2507-2512). The himian figl resides on chromosome 
19ql3.3-19ql3.4, a region previously identified to be involved in susceptibility to 
autoimmune diseases, including SLE, arthritis, multiple sclerosis, and insulin-dependent 
diabetes meUitus (Becker K.G. et al., PNAS 95 (1998) 9979-9984). Since its expression is 
largely Umited to immune tissues and its regulation is dependent on IL-4, a key modulator 
of the immune response, figl is thus an attractive candidate gene for autoinunune disease 
susceptibihty (Chavan, S.S. et al., Biochim Biophys Acta 1576 (2002) 70-80). The HLA- 
DR4-restricted presentation of a Figl epitope provides the first indication that Figl protein 
is produced and possibly involved in the disease development of RA The Figl polypeptide 
has not been known as a marker for RA until now, and is considered as an important 
candidate marker for RA. 



Hemopexin 

Another RA candidate marker which was firequently identified in serum samples (ten 
out of eleven samples) and in synovia samples (two out of four samples) (erosive & non- 
erosive RA) is derived from hemopexin (HPX): HPX (351-367) widi the amino add 
sequence of SEQ ID NO: 32 (Table 3). Several length variants were found which support 
the vaUdity of this epitope (Table 3): the 13-mer HPX (351-363; SEQ ID NO: 33), the 14- 
mer HPX (350-363; SEQ ID NO: 34), the 15-mer HPX (351-365; SEQ ID NO: 35), the 18- 
mer HPX (351-368; SEQ ID NO: 31) and the 18-mer HPX (350-367; SEQ ID NO: 78). 
Furthermore, the epitope contains a very strong DRBl'^0401 binding motif: 3551 serves as a 
PI anchor, 358D as a P4 anchor and 360V as a P6 anchor (binding score: 1%). 

Two length variants of the same epitope, HPX (351-367; SEQ ID NO: 32) and HPX 
(351-365; SEQ ID NO: 35), could also be identified in healthy control samples (Table 3) 
indicating that only some length variants are specific for RA, namely the antigenic peptides 
of SEQ ID NOs, 31, 33, 34 and 78. 

HPX is a 60 kD plasma glycoprotein with a high binding affinity to heme (MiiUer- 
Eberhard, U., Methods Enzymol 163 (1988) 536-565). It is mainly expressed in the liver, 
and belongs to the acute-phase proteins the synthesis of which is induced in an 
inflammatory situation. RA is a chronic inflanmiatory autoinumme disease and elevated 
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levels of several acute-phase proteins, including C-reactive protein and serum amyloid A, 
have been reported (Nakamura, R., J Clin Lab Anal 14 (2000) 305-313). HPX is responsive 
to the cytokines IL-1 and IL-6, which are upregulated in patients suffering from RA 
(Feldmann, M. & Maini, R.N., Rheumatology 38, Suppl 2 (1999) 3-7). 

HPX is the major vehicle for the transportation of heme in the plasma and its 
principal role is to prevent heme-mediated oxidative stress and loss of heme-bound iron 
(Tolosano, E. & Altruda, F., DNA Cell Biol 21 (2002) 297-306). It can protect cells against 
oxidative stress by inducing the expression of intracellular antioxidants such as heme 
oxygenase, metallothioneins and ferritin. Metallothioneins are cytosoHc proteins that are 
expressed particularly in synovial fibroblasts (Backman, J.T. et al., Virchows Arch 433 
(1998) 153-160). There is significant experimental evidence for the presence of oxidative 
stress in the synovial tissue of RA patients (reviewed in: Schett, G. et al.. Arthritis Res 3 
(2000) 80-86). Furthermore HPX was reported to promote proUferation of human T 
lymphocytes (Smith, A. et al., Exp Cell Res 232 (1997) 246-254). These studies render it 
likely that HPX belongs to the up-regulated proteins in serum and synovia of RA patients, 
thereby providing a rationale for the relevance of HPX (351-367) as a RA-specific candidate 
marker. 



Hsc70-interacting protein 

An epitope which was mostly identified in serum samples (4 out of eleven erosive and 
non-erosive RA samples) and which is also related to stress responses is derived from the 
Hsc70-interacting protein Hip: Hip (83-98) with the amino acid sequence of SEQ ID NO: 
38 (Table 3). Two length variants of this epitope were identified (Table 3): the 18-mer Hip 
(83-100; SEQ ID NO: 36) and the 15-mer Hip (84-98; SEQ ID NO: 39). An additional 
length variant was discovered in one erosive synovia sample (Table 3): the 15-mer Hip (85- 
99; SEQ ID NO: 37). As judged from the shortest length variant Hip (84-98) a DRB1''0401 
binding motif attaining a moderate score of 8%, can be postulated: 891 as PI anchor, 92D 
as P4 anchor, 94D as P6 anchor. 

In the cytosol of eukaryotic cells, Hip and Hop proteins associate with Hsc70 in order 
to participate in the regulation of Hsc70 chaperone activity (Frydman, J, & Hohfeld, J.> 
Trends Biochem Sd 22 (1997) 87-92). The 42 kD Hip protein binds to the ATPase domain 
of Hsc70. It was postulated that Hip might increase the half-life of the chaperone-substrate 
complex providing the molecular basis for an efficient cooperation of Hsc70 with 
downstream chaperone systems. Hsc70 and Hsp90 have been shown to cooperate during 
protein folding in vitro (Jakob, U, 8c Buchner, J., Trends Biochem Sd 19 (1994) 205-211; 



wo 2005/014622 



57 



PCT/EP2004/008609 



Freeman, B.C. & Morimoto, R.I., EMBO J 15 (1996) 2969-2979) and to play a role in 
thermal denaturation (Schneider, C. et al., PNAS 93 (1996) 14536-14541). The Hsc70 and 
Hsp90 association with stress-adaptation ultimately links Hip to stress responses, including 
the induction of heat shock proteins, in the synovial tissue of RA patients (reviewed in: 
Schett, G. et al.. Arthritis Res 3 (2001) 80-86). 



Bindmg properties of ITIH4, C4, C3, SH3BGRL3, Figl, HPX and ffip 

In order to further investigate the binding properties of the above described antigenic 
peptides of ITIH4, C4, C3, SH3BGRL3, Figl, HPX and Hip to HLA-DR4 molecules, in 
10 vitro binding assays were performed (Fig. 3): According to their IC50 values against the 
reporter peptide HA (309-317), the synthetic peptides ITIH4 (274-287), SH3BGRL3 (13- 
26), and Figl (293-309) bind to HLA-DR4 with high affinity (Fig. 3). Moderate binding to 
HLA-DR4 was measured for the synthetic peptides C4 (1696-1709) and HPX (351-365). 
The synthetic peptide C3 (1431-1444) boimd only weakly to HLA-DR4 which is in 
15 agreement with TEPITOPE scoring (9%). No binding was measured for the Hip (84-98) 
peptide (data not shown). 

Invariant chain (li) 

Among the candidate marker peptides that appear to be indicative for erosive and 
20 non-erosive RA is an epitope which is derived from the HLA-DR-assodated invariant chain 
(li): the 15-mer li (110-124; SEQ. ID NO: 83) with the amino add sequence 
ATPLLMQALPMGALP (Table 3). Moreover, four length variants were identified: the 16- 
mer fi (109-124; SEQ. ID NO: 81), the 17-mer li (109-125; SEQ. ID NO: 80), the 18-mer li 
(109-126; SEQ. ID NO: 79) and the 21-mer li (109-129; SEQ. ID NO: 82) (Table 3). The 
25 presence of length variants gives confidence in the identified peptide sequence which 
displays a moderate DRB1''0401 binding motif (TEPITOPE binding score: 5%) with 114L 
serving as a PI anchor, 117A as a P4 anchor and 119P as a P6 anchor. The epitope was 
identified in two erosive and three non-erosive RA samples. Two li length variants were 
found in one unpulsed sample and two samples of healthy test persons. 

30 Peptide loading of MHC class II molecules is regulated by two accessory molecules in 

the class 11 pathway, invariant chain li and HLA-DM (reviewed by Bakke, O. 8c Nordeng 
T.W., Immunol Rev 172 (1999) 171-187 and Kropshofer, H. et al., Inmiunol Today 18 
(1997) 77-82). li trimers bind to nascent MHC class 11 molecules in the rough endoplasmic 
reticulum (ER) and block the peptide-binding groove, stabilizing the class II molecule and 
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preventing the binding of ligands available in the ER. Class Il/Ii complexes are transported 
via the Golgi apparatus to endosomes, where li is degraded to a nested set of class II- 
associated li peptides (CLIP). The release of CLIP enables endosomal peptides to bind to 
the MHC class II moleciiles v\rhich are presented to T cell receptors on the ceU surface. The 
identified epitope li (110-124) overlaps with the C-terminal part of CLIP. Interestingly, a 
reduced interaction of CLIP with RA.-associated HLA-DR alleles was described (Patil, N.S. 
et alo J Immiinol 167 (2001) 7157-7168), indicating that a reduced interaction may 
contribute to the pathophysiology of autoimmtmity in RA. 



Retinoic acid receptor responder protein 2 

In 4 out of 11 serum samples (erosive & non-erosive RA) an epitope was identified 
which is derived firom retinoic add receptor responder protein 2 (RARRES2): the 22-mer 
RARRES2 (40-61; SEQ. ID NO: 86) v^th the amino acid sequence 
HPPVQWAFQETSVESAVDTPFP (Table 3). In the C-terminal part of the epitope two 
length variants could have been identified: the 23-mer RARRES2 (40-62; SEQ. ID NO: 84) 
and the 24-mer RARRES2 (40-63; SEQ. ID NO: 85) (Table 3). With 45W as PI anchor, 
48Q as P4 anchor and SOT as P6 anchor, the epitope displays a moderate binding motif in 
the context of DRBl'^0401 (TEPITOPE binding score: 3%). 

RARRES2 is a small 18.6 kD protein and mosdy expressed in the endothelium and 
epidermis. The expression appears to be hormone dependent and a response to retinoic 
acid in skin and some osteotrophic hormones in marrow-derived stromal cells was 
identified (Nagpal, S. et al., J Invest Dermatol 109 (1997) 91-95 and Adams, A.E. et al., J 
CeU Biochem 74 (1999) 587-595). The fimction of RARRES2 is largely unknown and an 
association with RA, for instance through impaired osteodastogenesis, is imaginable. 



Fibronectin 

Another HLA-DR4 associated peptide which was exclusively found in 3 out of 4 
synovia samples (erosive & non-erosive RA) is derived firom fibronectin (Fn), a major 
glycoprotein in blood plasma and in the extracellular matrix: the 15-mer Fn (1881-1895; 
SEQ. ID NO: 90) with the amino acid sequence lYLYTLNDNARSSPV (Table 3). The 
epitope appears to be a very good DRB1'*^0401 binder with 1885Y serving as a hydrophobic 
PI anchor, 1888N as a P4 anchor and 1890N as a P6 anchor (TEPITOPE binding score: 
1%). Four length variants were additionally identified: the 16-mer Fn (1881-1896; SEQ. ID 
NO: 91), the 16-mer Fn (1880-1895; SEQ. ID NO: 88), the 17-mer Fn (1881-1897; SEQ. ID 
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NO: 89) and the 17-mer Fn (1880-1896; SEQ. ID NO: 87), strongly supporting the validity 
of the identified Fn epitope (Table 3). 

Fibronectin plays an important role in cell adhesion, cell motflity and in 
opsonization. It binds coUagen and fibrin and mediates adhesion of fibroblasts to collagen 
fibrils. Fibronectin is strongly expressed in the synovial lining layer of both RA and 
osteoarthritis patients and correlates with hyperplasia of diseased joints. The identification 
of epitope Fn (1881-1895) in only samples of synovial origin is in line with these findings 
and points to a putative role of highly abundant fibronectin in autoimmunity. 



Cathepsin B 

Three out of fifteen RA samples (erosive & non-erosive RA) gave rise to an epitope 
which originates firom Cathepsin B (CatB): the 15-mer CatB (227-241; SEQ. ID NO: 92) 
with the amino acid sequence YNSYSVSNSEKDIMA (Table 3). The peptide displays a 
moderate DKBl'^0401 binding motif with 230Y as a hydrophobic PI anchor and the serine 
residues at positions 233 and 235 as putative P4 and P6 anchors (TEPITOPE binding score: 
6%). 

Cathepsin B is a thiol peptidase and beUeved to participate in intracellular 
degradation and turnover of proteins, e.g. collagen. It is located in lysosomes of different 
cell types including leukocytes. In the context of inflammation and disease it was shown 
that Cathepsin B contributes to cartilage destruction in osteoarthritis and pathological 
proteolysis in RA and cancer (Cuimane, G. et al.. Arthritis Rheum 44 (2001) 1744-1753). 
Enzyme activities of Cathepsin B and other lysosomal peptidases correlate with RA 
progression which supports the validity of epitope CatB (227-241) as a putative RA marker 
peptide (Sohar, N. et al., Biol Chem 383 (2002) 865-869). 



Tripeptidyl-peptidase II 

Another peptidase-derived epitope which was found in three out of fifteen RA 
samples (erosive & non-erosive RA) is derived from tripeptidyl-peptidase 11 (TPP2): tihie 
15-mer TPP2 (970-984; SEQ. ID NO: 93) having the amino add sequence 
AGSLTLSKTELGKKA (Table 3). Additionally, the length variant TPP2 (970-985; SEQ. ID 
NO: 94) was identified. The peptidase epitope contains a typical DRBl'^0401 binding motif 
with 973L, 976S and 978T as PI, P4 and P6 anchors respectively (TEPITOPE binding score: 
3%). 
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Similar to Cathepsin B, tripeptidyl-peptidase II is involved in lysosomal protein 
degradation. The identification of epitope TPP2 (970-984) in the context of RA provides 
the first indication that TPP2 might be implicated in impaired protein degradation in the 
context of inflammation and RA. 



Legumain 

Legumain (LGMN) completes the set of three peptidases which were foimd in this 
investigation. The identified epitope LGMN (99-112; SEQ. ID NO: 95) with the amino add 
sequence VPKDYTGEDVTPQN (Table 3) displays a typical binding motif with respect to 
the HLA allele DRBl'^CMOl in which 103Y could serve as a PI anchor, 106E as a P4 anchor 
and 108V as a P6 anchor (TEPITOPE binding score: 1%). 

Legumain occurs in endosomal and lysosomal fractions of antigen-presenting cells 
such as dendritic cells (Schwarz, G. et al., Biol Chem 383 (2002) 1813-1816). It has a strict 
specificity for the hydrolysis of asparaginyl bonds and was shown to play an important role 
in the processing of bacterial antigens for MHC class 11 presentation (Manoury, B. et al., 
Nature 396 (1998) 695-699). Whether Legumain is involved in autoimmune diseases, such 
as RA, remains unknown but the fact that three lysosomal peptidases together with GILT 
and the 26S proteasome were identified in this investigation suggests that the protein 
degradation machinery ^icilitating antigen processing might be significantly altered in the 
context of RA 



Platelet activating factor receptor 

An epitope which was identified in three out of eleven sermn RA samples (erosive 8c 
non-erosive RA) is derived from a receptor for platelet activating factor (PAFR): the 13- 
mer PAFR (264-276; SEQ. ID NO: 96) with the amino acid sequence DSKFHQAINDAHQ 
(Table 3). According to TEPITOPE scoring (3%) the epitope contains a moderated 
binding motif with 267F as PI anchor, 270A as P4 anchor and 272N as P6 
anchor (Table 3). 

Platelet activating factor (PAF) is a pro-inflanmiatory Upid mediator which binds to a 
G-protein-coupled seven transmembrane receptor on the sur&ce of a broad range of cell 
types. By receptor binding, PAF transduces pleiotrophic fimctions which indude cell 
motility, smooth muscle contraction, and synthesis and release of cytoldnes (reviewed by 
Honda, Z. et al., J Biochem 131 (2002) 773-779). Pharmacological studies and the 
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establishment of PAFR (-/-) mice have suggested that PAF functions in a variety of settings 
including allergy, inflanmiation, neural functions, reproduction, and atherosclerosis. 
Interestingly, PAF was found in elevated levels in the synovial fluid of RA patients and 
shown to induce neoangiogenesis, which is frequently observed in rheumatoid synovitis 
5 (Lupia, E. et al., Eur J Immimol 26 (1996) 1690-1694). This type of evidence gives further 
confidence in epitope PAFR (264-276) being a RA-assodated marker peptide. 



Poly-alpha-2,8-sialyltransferase 

Another epitope which was foxmd in three out of eleven serum samples (erosive & 
10 non-erosive RA) originates from Polysialyitransferase (PST): the 15-mer PST (333-347; 
SEQ. ID NO: 97), MPLEFKTLNVLHNRG (Table 3), displays a moderate bmding motif 
with respect to DRBl'^0401 binding (TEPITOPE scoring: 2%). 337F could serve as a PI 
anchor, 340L as a P4 anchor and 342V as a P6 anchor. 

PolysiaUc acid is a carbohydrate composed of a linear homopolymer of a-2,8-linked 
15 sialic acid residues. The glycan is mainly attached to the neural cdl adhesion molecule (N- 
CAM) and implicated in many morphogenic processes of the neural cdls by modulating 
the adhesive property of N-CAM. The membrane protein polysialyitransferase catalyzes the 
polycondensation of sialic add residues and is highly expressed in fetal brain, lung and 
kidney, in adult heart, spleen and thymus, and to a lesser extent in peripheral blood 
20 leukocytes (Nakayama, J. et al., Proc Natl Acad Sci 92 (1995) 7031-7035). Elevated PST 
levels were observed in serum of patients with metastatic tumors and an increased enzyme 
activity was also associated with rheumatoid arthritis (Barge, P.G. et al., Klin Wochenschr 
60 (1982) 445-449). The identification of epitope PST (333-347) in several RA samples is in 
line with these observations and supports the putative role of PST in RA. 

25 

Ras-related protein Rab-1 IB 

The last epitope presented in this analysis of RA samples is derived from the Ras- 
related protein Rab-1 IB: the 13-mer Rab-1 IB (51-63; SEQ. ID NO: 102) with the amino 
acid sequence RSIQVDGKTEKAQ (Table 3, the peptide sequence were vahdated with 
30 regard to binding to the RA susceptibility allele DjRBJ^0301 by using the TEPITOPE 
software (Hammer, J. et al., Adv Inmiunol 66 (1997) 67-100)). This epitope and four 
additional lenght variants, the 14-mer Rab-llB (50-63; SEQ. ID NO: 100), the 15-mer Rab- 
IIB (49-63; SEQ. ID NO: 98), the 17-mer Rab-llB (49-65; SEQ. ID NO: 101) and the 18- 
mer Rab-llB (49-66; SEQ. ID NO: 99) were found in three erosive and two non-erosive 
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RA samples (Table 3). The 15-iner Rab-llB (49-63) was identified in 1 out of 12 cx>ntrol 
samples: 

.Rab proteins are small GTPases that play an important role in membrane trafficking 
along the endo- and exoqrtic pathway. Rab-llB is assimied to be essential for the transport 
5 of intemalized transferrin firom the recycling compartment to the plasma membrane. 
Furthermore Rab-llB was identified in rat osteoclasts and might play an additional role ia 
bone resorption. The epitope Rab-llB (51-63) provides the first indication of a role of 
Rab-llB in RA development 
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